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AN OUTLOOK REPORT* 


WiuiiaM R. LEonarD 
United Nations 


AST year your President spoke on the subject of “Statistical Frontiers” and 
[ of an ever-expanding statistical universe with bright and constantly mov- 
ing horizons. The universe contained three continents—descriptive methods, 
design of experiments and investigations, and analysis and theory.’ Tonight I 
should like to explore further some aspects of descriptive statistics, from the 
point of view of their meaning and substance, rather than from the point of 
view of method, with particular reference to the scope and magnitude of the 
work that lies ahead. 

As Miss Cox said, descriptive statistics are statistics in their oldest and most 
traditional form; they refer to problems of the state. It is true that statistical 
methods are now finding decisive applications in almost all branches of human 
activity, but this by no means diminishes the importance of their applications 
to economic and social fields. As Professor Maurice Kendall said in his inaugural 
lecture at the London School of Economics in 1950: “It remains true, I think, 
that the work and welfare of mankind provide the most important domain for 
statistical inquiry.”? ‘ 

The scope of economic and social statistics, on an international scale, is al- 
ready large and it is growing. If we assume that world population, now esti- 
mated at 2.7 billion persons, should continue to grow at its current rate of in- 
crease, total population will have doubled by the year 2000—only 43 years from 
now. If we assume that the world will continue to produce goods and services 
at the rate which obtained from 1950 to the present, the world’s product or in- 
come by the year 2000 will be nearly four times its present size. 

The statistical requirements which will be imposed by increases of these 
magnitudes will be very large indeed but these rates of increase do not repre- 
sent the real challenge faced by economic and social statisticians. The economic 
and social world is not only growing larger but it is becoming vastly more com- 
plex. Dynamic changes of great consequence are occuring or impending both in 
the highly industrialized countries of the West and in the now largely agrarian 
countries of the Far East and Africa. 

The common market proposals now being discussed, in Europe and else- 
where, if instituted, would have very far-reaching implications concerning the 





* Presidential address, at the 117th Annual Meeting of the American Statistical Association, Atlantic City 
New Jersey, 12 September, 1957. 

1 Journal of the American Statistical Association, 52 (1957), 1-12. 
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organization of production and the capital requirements needed to give effect to 
different production and trading patterns. The economic development pro- 
grams now being undertaken by many non-industrialized countries have similar 
implications. The magnitude of the latter problem can be sensed if we realize 
that some two-thirds of the world’s people live in under-developed countries 
and receive only about one-third of the world’s income. This kind of compari- 
son is the more striking when we realize that the roughly six per cent of world 
population living in the United States receive over 40 per cent of world income. 
If the under-developed countries are to become industrialized in the Western 
sense, enormous quantities of plant and equipment, transport facilities and 
indeed all types of productive capital must be created. From the statistical 
point of view, it will be necessary for the statisticians to develop statistical 
services to indicate the nature and size of the capital investment required. 

Increased industrialization must have as basic requirements the develop- 
ment of suitable analytical methods and the availability of bodies of statistics 
to describe the problems and their relationships, to interpret the past and to 
plan the future. These responsibilities devolve upon the statistician, and in 
fact, he must anticipate them so as to avoid the waste and frustration which 
attend decisions based upon ignorance and supposition. Whatever the rates of 
population growth and of economic development turn out to be, there will be 
multipliers of considerable size, as far as the statistical requirements ure con- 
cerned, introduced by the expanding complexity of our economic and social en- 
vironment and by the inherent propensity toward change. 

Thus, vast new areas for the application of statistics in the economic and 
social field are emerging. In addition, large gaps in basic economic and social 
statistics remain to be filled even if we are considering simply the more or less 
conventional measures. For example, in only a few countries can we find size 
distributions of the ownership of land; for most of the world’s populations there 
are no systematic sources indicating changes in rates of growth of the popula- 
tion, the incidence of the common diseases, the disposal of resources among 
different uses and among different economic classes. To fill the gaps, of course, 
is of outstanding importance but in addition I believe there are urgent reasons 
to re-examine the methods and objectives of some of the traditional statistical 
inquiries. 

There is some tendency for economic and social statisticians to accept with- 
out much question the concepts and rubrics, particularly those of the industrial- 
ized countries, which have been handed down through several generations of 
statisticians. We have seen many instances where attempts have been made to 
transplant the statistical systems of industrialized countries to less well-de- 
veloped countries. In most cases it was found that the attempt was not satis- 
factory. This, of course, is not an unexpected result because the concepts and 
definitions adapted to one type of economy would not necessarily be applicable 
elsewhere. At the same time, questions also arise as to the applicability of some 
of the traditional concepts to the changed and changing nature of the indus- 
trialized societies themselves. In suggesting that the methods and objectives of 
the traditional inquiries be re-examined, I do not mean that the conventional 
forms must be discarded. These must be strongly supported until new ones are 
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developed and tested. All I am saying is that the statistical apparatus should 
be under continuous investigation to be sure that it is performing efficiently © 
both as to objectives and as to cost and that it should be capable of reflecting 
the changing nature of the economic and social structure. 

Let us examine briefly two areas of economic and social statistics—one old 
and one new—about which new thinking may be required. The ‘census of popu- 
lation is perhaps the oldest statistical activity of states. Its origin is lost in 
antiquity—the Book of Numbers gives an account of the census of the tribes of 
Israel while in Roman times the census was already a regular tool of adminis- 
tration, used for the purposes of taxation and military conscription, the two 
main functions of the state at that time. 

In the 18th and 19th centuries census-taking flourished, particularly in the 
Americas and Western Europe. It was, of course, the principal statistical in- 
strument at a time when the industrial revolution was having a profound effect 
upon the economic and social organization of society. It is true there were some 
other bodies of statistics, such as external trade statistics, but these were partly 
at least by-products of administration. These new bodies of statistics resulted 
from the growing concept, and it is surprising how new that concept is, which 
regarded the state as an economic entity as well as a political entity for which 
economic policies and an economic climate had to be established. 

The population census received early international attention by the Inter- 
national Statistical Congress, which met nine times between 1853 and 1876, and 
by the International Statistical Institute, founded in 1885. At the 5th Inter- 
national Statistical Congress, meeting in St. Petersburg in 1872, agreement was 
reached upon a standard list of questions to be recommended to national census 
authorities. The astonishing fact is that the list is remarkably like the list used 
today in many countries. It included name, age, sex, marital status, occupation 
and economic status—i.e., whether the person was an employer or an employee. 
These categories describing economic characteristics, with modest expansions 
from time to time, have become pretty much hallowed by usage, and I believe 
there is some tendency for them to be regarded as adequate to distinguish the 
important characteristics of the population. A more critical view, however, 
might result in numerous questions. The economic status classification, for 
example, as applied to the economically active population was no doubt ad- 
mirably suited to the circumstances prevailing during the height of the indus- 
trial revolution. In the age of the factory—and of the craft union—the dis- 
tinctions between one industry and another and between employer and em- 
ployee were abundantly clear and, what is perhaps more important, the ways of 
life of the members of the various crafts and industries were so different that 
they were in fact different social groups, the characteristics of which were 
worth distinguishing and investigating for the sake of the information they 
provided about the underlying nature of the society of which they were part. 
This contrasts sharply with the situation today when the plant worker may, at 
some time during the year, become an own account worker or possibly an em- 
ployer. The description of his economic status therefore depends upon the 
time of year the census is taken. There is also the question of identifying em- 
ployers in a large corporate enterprise—the stockholders? The general manager? 
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The foremen who have the power to hire and fire but in fact are employees? 
It is obvious that considerably elaborated sub-categories within each main item 
would be necessary to clarify the meaning of the figures. But such elaboration 
is most certain to be costly and does not entirely dispose of the main question 
which concerns the purpose of the classification itself. 

What characteristics are these data expected to reveal today? How will we 
use them either by themselves or when cross-classified with other demographic 
or economic characteristics? For purposes of this discussion, I am leaving aside 
questions of enumerator and response bias which are likely to be extremely con- 
sequential in connection with questions concerning rather vague and variable 
characteristics. The difficulties arising in connection with the status classifica- 
tion, and some doubt about its usefulness in any case, are so numerous that 
several national census offices are considering either abandoning the classifica- 
tion altogether or devising something entirely different. Even in 1901, one com- 
mentator in referring to the 1891 census of Great Britain said: “The attempt 
made to distinguish between ‘employer’ and ‘employee’ was admittedly not a 
success nor de I think any question can be framed which will serve to secure a 
categorical answer in every case.”* But again, what is it we really require of a 
comprehensive census of population, with involved classifications set up to de- 
scribe economic characteristics, conducted once in ten years, producing tabu- 
lated statistics one, two, three or more years later? 

The original idea of the classification was clearly to establish the numbers of 
persons in each class of worker group to allow certain inferences to be made as to 
levels of economic and social well-being. It is clear that under present condi- 
tions, either in the industrialized or in the agricultural economies, this exercise 
is not likely to come very close to the mark. I have no specific proposal to make 
at this time but perhaps what is needed is a device which will classify the eco- 
nomically active population in such a way as to show more directly whether 
their capacity to command economic goods and services and to attain the more 
intangible social benefits is increasing or not. I am not certain that this is the 
right direction at all but I am certain that a great deal of thought will be given 
to this question in the next few years both by the national census offices and by 
the international agencies which are attempting to develop useful and accept- 
able standards for statistical work in this field. Discussion is already going for- 
ward, and several countries are expecting to conduct field tests of different 
concepts. 

There is at least one other serious question about the classic approach to 
population censuses, and this concerns the occupational classification. I am 
told by my colleagues that this classification is invaluable. It provides an essen- 
tial base for rates of occupational morbidity and mortality; it can be used for 
production planning—as in war-time—and for occupational guidance; it is a 
basis for inferring something about the general economic well-being of the eco- 
nomically active population and when cross-tabulated with the economic status 
classification, produces a socio-professional (or occupational) grouping which 
in some countries has considerable meaning; the occupational classification 
can be used to formulate and evaluate broad social programs such as education. 





* Reginald Dudfield, “Census Taking,” Journal of the Institute of Actuaries, XXXV (1901). 
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But it may be, however, that statisticians are looking for the wrong things in 
population censuses. Are we trying to study such things as industrial or occupa- 
tional morbidity in small but vaguely defined groups and failing to study more 
general problems of radiation, air pollution and smoking? Is it possible to pro- 
duce sufficiently exact occupational data by census enumeration for production 
and occupational planning without undue delay? Perhaps well planned sample 
surveys of occupations, or special registrations in times of emergency, would 
produce better statistics for these purposes. 

The occupational classification, like the status classification, is of a respect- 
able age. It was first recommended for international use by the International 
Statistical Institute in 1892, having been drawn up by Bertillon at the request 
of the Institute.‘ There were 12 main headings: 


Production of Raw Materials 


I. Exploitation of land (including fishing and hunting) 
II. Extraction of minerals 


Transformation of raw materials 


III. Industry 
IV. Transport 
V. Commerce 


Public administration and the liberal professions 


VI. Military and police 
VII. Public administration 
VIII. Liberal professions 
IX. Rentiers 


Miscellaneous 


X. Domestic servants 
XI. Without employment 
XII. Unproductive and unknown 


This classification was used without major changes in many subsequent cen- 
suses and is recognizable today in the International Standard Classification of 
Occupations only recently established by the International Labour Office. In 
the latest classification, at least six of its eleven main categories are very similar 
indeed to the 1892 list. It is clear of course that such broad groups can have 
only very general meaning and are not of particular value for some of the spe- 
cific purposes mentioned above. The same is true of the 9 major groups of the 
International Standard Industriai Classification. To a certain extent the two 
classifications, in their broadest groupings, distinguish the same things, i.e., 
persons working in an agricultural establishment (the industrial classification) 
and farmers (the occupational classification). 

More specific uses, therefore, would require greatly elaborated detail even 
unto the third and fourth digit (in one under-developed country the occupa- 
tional classification scheme, used for population censuses, contains 2,500 titles). 
Any extended classification immediately imposes very considerable operational 
problems. The increased number of groups probably increases enumerator and 
response bias and without question greatly increases tabulation and publica- 





‘4 Bulletin, Institut International de Statistique, Tome VI, 1892. 
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tion problems which of all census operations seem to cause the greatest diffi- 
culty. The tabulation problem is a very real one. Even disregarding smali area 
tabulations—but these are almost universal—full exploitation of the occupa- 
tional classification requires cross-tabulation by industry, by class of worker, 
and by sex. Given the very minimum cross-tabulations, where only the one- 
digit categories of occupation and industry are used, some 800 cells, disregard- 
ing small area tabulations, form the basic table. Few countries confine them- 
selves to the minimum, although some restraint is exercised in the two- and 
three-digit cross-tabulations by occupation and industry. But usually many of 
the cells emerge rather sparsely inhabited. I find, for example, that on 1 April 
1950, there were 5 female plumbers in Maryland and 9 female blacksmiths in 
Wisconsin, one each in the three metropolitan areas of Madison, Milwaukee, 
and Racine, and the rest scattered throughout the state. While these are ex- 
treme examples, this kind of result, I think, is not statistics in the usual sense 
of aggregating observations to simplify analysis and understanding. 

This problem must also be examined and solutions found. Considering the 
biases in the data, the cost, and the fact that these tabulations are among the 
last to be produced in almost all countries, statisticians may need to inquire 
into the specific purposes to be achieved by detailed census data on economic 
characteristics; into whether or not the classifications should be greatly short- 
ened or perhaps combined with something like a class of worker grouping in a 
new form; into whether occupational details should be collected at all by a com- 
prehensive census but rather by an associated sample survey which would have 
implications about the feasibility of small area tabulations and the use of ex- 
tremely detailed occupational groups. While statisticians from several countries 
have told me that they have no trouble with the occupational classification 
from an operational standpoint, this does not entirely settle the basic question 
of purpose and use. Have statisticians had sufficient regard for the transition 
from a largely handicraft to a largely machine age during the past century? 

Having mentioned two problems which, in the oldest statistical field of all, 
need re-examination, let me briefly touch on a new problem with which the 
conventional statistical apparatus does not deal directly and effectively. This 
is the question of measuring, nationally and upon an internationally compar- 
able basis, what is now called levels (or conditions) of living. The traditional 
family budget or expenditure studies do not embrace the concept sought here 
although they have their place in the galaxy of studies which surround the cen- 
tral concept. What is wanted is a measure or measures which will somehow 
describe and summarize a person’s position in the economic and social com- 
munity in which he lives—his housing, his health, his education, his nutrition; 
in short his command over all economic and social benefits, how they change in 
time and how they compare with benefits enjoyed in other countries. This, in- 
deed, is a difficult problem and it is also an important one since one of the main 
purposes of modern organized societies is to adopt policies designed to support 
improved economic and social conditions for their citizens. The United Nations, 
the International Labour Office, and other specialized agencies associated with 
the United Nations, have given considerable attention to statistical techniques 
which might be expected to produce the desired results. Without going into 
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details, one proposal involves the selection of “indicators”—such as expectation 
of life at birth, food supplies in terms of calories per capita, proportion of chil- 
dren five to fourteen years of age attending school and proportion of economi- 
cally active population unemployed. These are largely elements plucked out of 
existing statistical systems of the conventional type. These measures obviously 
do not apply equally well to all countries or to all cultures, and there is no way 
of weighting them into any unitary index, though this is perhaps a minor ob- 
jection. There is also the suggestion that properly planned and comprehensive 
family living studies on a sample basis, repeated from time to time in each 
country, would provide the necessary information. But there are problems 
here too; aside from the cost, considerable technical work would need to be 
done to stabilize response error in time-to-time comparisons and to provide for 
place-to-place comparisons at least on a minimum basis. I do believe that 
reasonably adequate measures of levels of living can be achieved in time but 
they present many difficulties of conception and operation that are far from 
being overcome. ' 
In attempting a rather concise outlook report on the world-wide statistical 
situation I have so far dealt briefly with the present position (statistics gen- 
erally are not adequate), with the general dimensions of the tasks that lie 
ahead (assuming rather rapid rates of change), and with a few problems of con- 
tent and method which may illustrate the need in a changing and developing 
world for a re-examination of conventional concepts in some subjects of eco- 
nomic and social statistics. Under these circumstances, how well equipped are 
we nationally and internationally to meet the challenge offered? In this respect 
there are entries on both sides of the accounts. On the debit side there still re- 
mains a desperate need for training and education in statistics, most particu- 
larly in the under-developed countries. Training is required at all levels from 
the mathematical statistician, who can work out suitable statistical methods 
for practical problems, to the full cast of specialists, clerks and administrators. 
In under-developed countries more emphasis must be given to sampling than 
to the older and more conventional methods if adequate statistics are to be 
available for general policy guidance. Secondly, in most countries of the world 
the salary levels and tenure of statisticians in the national services are very 
unsatisfactory. The comparatively low salary scales result in high turnover 
rates. Statisticians who have received advance training at home or abroad are 
likely to be lured to higher salaried posts elsewhere in government or in busi- 
ness. The lack of tenure in many national services means that the systematic 
development of statistical programs and of training operations is seriously im- 
paired. While these are long-term problems and depend for their solution upon 
general administrative improvements in governmental organization, it is inter- 
esting to note that a few countries, with assistance from international statis- 
tical agencies, have managed to establish a statistician class in their civil serv- 
ices which is at least as well paid as other professional groups. Thirdly, all 
national services suffer, to a greater or lesser extent, from inadequate and un- 
certain appropriations. This is a familiar problem and one which in the end re- 
sults in waste and extravagance. Relatively expensive programs such as cen- 
suses are sometimes never completed so that countries are deprived of the con- 
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tributions these statistics could make to the conduct of their affairs. Here again 
solutions must rest upon the gradual development of more stable and more 
mature government administration. Although we cannot really measure the 
value of statistics to a country we can confidently assert that there is a very 
large discrepancy between cost and value, as governments and other institu- 
tions are coming to realize. 

There are, however, a number of encouraging entries on the credit side of the 
accounts. Since the end of World War II the statistical establishments in a 
large number of countries have been greatly strengthened and in others na- 
tional statistical services have been set up for the first time. In most countries, 
the statistical programs have been overhauled to satisfy more nearly the re- 
quirements placed upon them. In these activities the international inter-govern- 
mental agencies have played a considerable role, mostly by giving advice and 
assistance to requesting countries in organizing, or reorganizing, the national 
services, and in providing expert guidance in re-shaping and modernizing the 
statistical programs. Since 1948, for example, the United Nations, the Inter- 
national Labour Office, the Food and Agriculture Organization, the World 
Health Organization, and the other specialized agencies of the United Nations 
family have supplied more than 200 experts to 52 countries for periods running 
from a few months to two or three years. This is small in relation to the require- 
ments, but funds are limited. Even more limited, however, is the supply of ad- 
visors with suitable experience available for assignment. Many different factors 
account for this and the statistician who says “have slide-rule, will travel” is 
most eagerly sought. . 

The international organizations are also tackling the training and education 
problem referred to above. This is being done in several ways. The statistical 
advisors are generally expected to organize in-service training courses and to 
help select qualified candidates for more advanced training abroad. This is 
normally in the form of fellowships and scholarships extending for periods of six 
months to two years. Since 1948 more than 450 statisticians from 76 countries 
have received advanced training under granis from the international agencies. 
In addition an average of 30 new students have enrolled in each of the ten 
terms of the International Statistical Education Centre at Calcutta, which is 
carried on under the auspices of the International Statistical Institute with the 
very generous assistance of the Government of India. A similarly sponsored 
institution at Beirut, financed by the Government of Lebanon, private grants, 
and contributions from United Nations Educational, Scientific and Cultural 
Organization has served 141 students during four terms. The Inter-American 
Statistical Institute has established a permanent center at Santiago and spon- 
sored three shorter term courses elsewhere in Latin America. The Statistical 
Training Center at the University of the Philippines, established by the United 
Nations in 1954, has had an average of 80 students during each of the six se- 
mesters. All these activities have a considerable impact on the level of statistical 
competence but there are often discouraging setbacks. I heard recently that 
two young statisticians who spent a year at one of the best European training 
institutes were persuaded upon their return home to join a different ministry 
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in their government than the one responsible for statistics. In that country 
there is only one trained statistician actually working in statistics. 

Apart from these more or less permanent centers and the system of fellow- 
ship awards, the international agencies have arranged for a large number of ad 
hoc training centers or seminars. These operate for periods of a few weeks to 
three or four months and have taken up a variety of subjects in the fields of 
agriculture, industrial production, labor, population, vital and health statistics, 
trade and balance of payments, statistical organization and statistical methods, 
particularly sampling. In all since 1948 there have been 45 such centers or 
seminars, attended by 1600 participants from more than 100 countries. Many 
of the centers have been sponsored jointly by two or more international agen- 
cies in co-operation with the host governments. 

I should note that the international agencies are not alone in giving statistical 
assistance and advice and in providing facilities for education and training in 
statistics. Several national governments, particularly the United States and 
the countries participating in the Colombo Plan such as Canada, are carrying 
on extensive programs in close co-ordination with the international activities, 
and many other governments, particularly in Western Europe, are providing 
excellent training facilities for foreign students. 

Associated with the educational enterprises and the training centers in the 
effort to improve the general level of statistics throughout the world is the 
preparation of technical guides or manuals which set out, step by step, con- 
cepts and methods involved in particular applications. These “do it yourself” 
handbooks are invaluable materials either for organized training courses or for 
use by working statisticians in national offices. A number of manuals on sub- 
jects in the economic and social fields, dealing both with substance and with 
methodology, have already been issued by the international agencies and sev- 
eral more are in preparation. These represent a unique literature never before 
available. They deal mainly with the practical problems of applied statistics 
which are likely to be encountered by the statistician of new and under- 
developed countries. They are based, as far as practicable, upon the uniform 
definitions, classifications and methods developed by the international 
organizations. 

The development of uniform statistical standards recommended for use by 
national statistical services owes much to the earlier work of the International 
Statistical Congress, the International Statistical Institute, and, after World 
War I, to the League of Nations, the International Labour Office, and the 
International Institute of Agriculture. Development of new standards and the 
revamping of older standards have been fairly rapid since the end of World 
War II. The present standards have been worked out upon a basis of broad 
agreement with national statisticians sc that they represent expert opinion as 
well as a synthesis of views upon problems and methods of practical application. 
This procedure has resulted in a much wider acceptance and use of the recom- 
mended proposals than obtained in earlier days. In general—there are a few 
exceptions—recommendations of international organizations are not manda- 
tory upon governments. Acceptance by a country must depend upon whether 
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the proposals are technically sound and conform with national requirements. 
It was the desire on the part of some members of the International Statistical 
Congress to impose recommendations upon governments, that wrecked the 
Congress in 1879 when Bismarck withdrew German participation and other 
countries took similar action thereafter. 

One more word about present arrangements for the development and im- 
provement of international statistics. A number of forums exist for the discus- 
sion of statistical problems. The Statistical Commission of the United Nations, 
composed of statisticians from fifteen countries, meets once every two years to 
review the statistical position generally and to act upon recommendations for 
standards on subjects within the competence of the United Nations itself and 
to lend general support to the related programs of the specialized agencies. 
Some of the specialized agencies have statistical committees, composed of 
national statisticians, which meet from time to time. The International Con- 
ference of Labour Statisticians, which held its first meeting in 1923, meets 
periodically. The International Statistical Institute, while now concentrating 
its attention mainly on aspects of education and training rather than on the 
promulgation of recommendations, exerts a powerful stimulus to the develop- 
ment of statistics generally. 

Apart from these world-wide institutions there are now several regional or- 
ganizations which discuss, and make recommendations concerning, regional 
problems and the adaptation of international recommendations to regional 
conditions. The work of the Inter-American Statistical Institute, organized in 
1940, is well known to you. To some extent the Institute has an independent 
program but in other respects it acts as a regional forum for discussion of ques- 
tions referred to it by the United Nations and the specialized agencies. The 
United Nations has organized, in co-operation with its regional economic com- 
missions, the Conference of European Statisticians (1953) and the Conference 
of Asian Statisticians (1957). These groups are composed of the heads of the 
national statistical offices and by means of plenary meetings and ad hoc working 
groups on particular subjects bring regional viewpoints to bear on world-wide 
recommendations. : 

In conclusion, after this brief review of existing arrangements to improve 
statistics throughout the world, I should like to add that the enthusiastic and 
fruitful collaboration of national governments in international statistical work 
has been the most important force in the accomplishments so far. And this, of 
course, is the main reason why one may reasonably expect that the work that 
lies ahead—in filling gaps, in re-examining objectives and programs, in devising 
new methods for new and expanding problems and in training the present and 
future generations of statisticians—will be carried on with enthusiasm and suc- 
cess. The outlook is good. 





THE CONTRIBUTIONS OF KARL PEARSON* 


Heven M. WALKER 
Columbia University 


NE hundred years ago last March there was born in London the man who 

has been called “the founder of the science of statistics.” It is most fitting 
that the statisticians of America should gather to do honor to one whose work 
has so profoundly advanced the development of our field. In fact without the 
work of Karl Pearson or someone else of similar convictions about the empirical 
and statistical approach to knowledge, similar breadth of interests, similar 
energy— intellectual and physical—and enough confidence in his own mental 
processes to carry him over all obstacles, few of the people in this room would 
find themselves in any statistical occupation. 

Scientific knowledge is cumulative through time. Therefore most scientific 
theories carry within themselves the germs of their own obsolescence. A pioneer 
in an uncultivated field not only must create new instruments, discover new 
facts, and invent new theory to explain these facts, but all too often he must 
also contend with apathy, opposition, and even hostility from entrenched in- 
terests threatened by the new theories or new methods. What he has learned 
he imparts to his students and to other young scientists who can begin where 
he leaves off, use his instruments and improve on them, enlarge the scope of his 
work, improve on his theory, perhaps even prove much of it false. They can 
stand on the platform he had to build. These younger scientists may not be 
more able than the pioneer but inevitably some of them will do work which 
supersedes his. The contributions of a scientist should not be evaluated chiefly 
in terms of how much of his theory and method are still considered to be the 
best theory and method known at some later date but should be evaluated in 
terms of how far the field was advanced by his life and work. Almost every 
high school graduate today knows more about physics than Archimedes knew 
and some of them will make discoveries Archimedes could not have made, but 
no one would for that reason question the world’s debt to Archimedes. In 
weighing the contribution of a scientist, then, we should ask such questions as: 
What was the state of the science when he began his reaserch? What new ele- 
ments did he introduce? How did his life work stimulate others? Did he in any 
serious way impede the work of others? What subsequent developments would 
have been impossible or improbable without his work? 

Modern statistical theory and method have developed in four great waves 
of ideas, in four periods, each of which has a very definite beginning but no 
ending. Some of the ideas developed in each period have what at present looks 





* A paper read in Atlantic City, September 11, 1957, at a joint meeting of the American Statistical A iati 
and the Institute of Mathematical Statistics. The chief source of biographical information was the 170 page memoir 
by Egon S. Pearson, Karl Pearson, An Appreciation of Some Aspects of His Life and Works, Cambridge University 
Press, 1938. Quotation has also been made from a paper by 8. 8, Wilks, “Karl Pearson: Fouider of the Science of 
Statistics,” The Scientific Monthly, 53 (1941), 249-53 aad irom a Centenary Lecture by J. B. s. Haldane delivered 
at University College, London, on May 13, 1957 entitled “Karl P , 1857-1957,” the text of which will be 
published in Biometrika in the near future. 
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like timeless validity. Other ideas and techniques have been superseded by 
more inclusive or more basic ideas and more effective techniques. 

The first great wave of modern statistical activity and thought was inaugu- 
rated by the publication of Francis Galton’s Natural Inheritance in 1889, by 
Karl Pearson’s first course of four lectures on “The Scope and Concepts of 
Modern Science” in 1391 and by his great series of “Mathematical Contribu- 
tions to the Theory of Evolution” which began in 1893. This period was marked 
by (a) the new conviction that the analysis of statistical data might provide 
the answers for a vast variety of importation questions regarding the life of 
plants, animals and men, (b) the collection of huge aggregates of data in 
quantity and diversity undreamed of before 1890, (c) the invention of the cor- 
relation coefficient and a variety of other methods for measuring association, 
(d) the invention of chi-square, (e) the proliferation of new statistics together 
with a strong emphasis on probable error formulas, (f) the attempt to char- 
acterize all frequency distributions by variants of a single differential equation, 
(g) interest in the variation among the individuals of a species in contrast to 
the earlier interest in variation among errors of observation, and (h) the con- 
struction of tables needed by the statistician and the biometrician. 

While chronologically the first studies in factor analysis belong in this period, 
that subject has always remained somewhat apart from the main stream of 
statistical thought, developing independently without much relation to the four 
waves of statistical activity to which I have referred. 

The second period began in 1915 when R. A. Fisher published his first paper 
on the exact distribution of the correlation coefficient. This period was marked 
by (a) the development of methods appropriate for use with small samples, (b) 
the search for exact sampling distributions as contrasted with tlie earlier general 
dependence on asymptotic probable error formulas, (c) the formulation of 
logical principles for testing hypotheses and for the design of experiments, (d) 
the invention of the technique of analysis of variance, (e) the development of 
criteria for choice among statistics used as estimators and the consequent slow 
disappearance of some previously popular statistics, (f) the extension of multi- 
variate analysis beyond the already familiar multiple correlation, (g) the clari- 
fication of the notion of degrees of freedom, and (h) the introduction of the 
concept of fiducial probability. 

The third period began about 1928 with the publication of certain joint 
papers by Neyman and Egon Pearson. The number of persons competent to 
develop new statistical theory now increased greatly and the important litera- 
ture began a rapid expansion. The logic of statistical inference was considerably 
changed in this period by the introduction of such ideas as (a) the second kind 
of error and the power of a statistical test, (b) the confidence interval as con- 
trasted with the symmetrical interval defined by a statistic plus or minus a 
multiple of its standard error used by Karl Pearson or the interval defined by 
the method of fiducial probability used by Fisher, (c) theories and techniques 
related to the selection of samples, especially for the purpose of surveys as 
contrasted with work on the design of experiments which was already well 
advanced, (d) the widespread application of statistical methods to quality con- 
trol in industry, (e) a new enthusiasm for nonparametric methods. 
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Apparently we are now moving into a fourth period the lines of which are 
not yet wholly clear. It could well be argued that sequential analysis belongs in 
this period because it was the contribution of Wald whose creative genius set 
off the reformulation of concepts which is now under way. However I prefer to 
place it in the third period along with other matters related to the Neyman- 
Pearson theory on testing hypotheses. The basic ideas of decision theory which 
may have a profound effect on statistical inference were developed by Wald 
shortly before his untimely death in 1950. 

Antedating these four great movements and preparing the way for them 
had been a long period of slowly increasing interest in the statistical way of 
thinking. In 1890 this interest was still sporadic, shared by so few people that 
it would be possible to mention most of them by name. In 1890 interest in 
statistical matters was restricted in scope and exhibited itself primarily in the 
collection of such public statistics as population data, vital statistics, and eco- 
nomic data; in actuarial work; and also in the adjustment of observations in 
astronomy and meteorology. Outside these areas, the movement was hampered 
not only by lack of interest but also by paucity of data and by the lack of 
adequate theory. Statistical theory was almost entirely that which had been 
developed by the great astronomers and mathematicians concerned with 
mathematical probability as related to errors of observations. Aside from cer- 
tain work by Poisson and Lexis, this theory related only to the binomial or the 
normal distribution. Aside from a very few persons whose work attracted little 
attention, theory related only to the distribution of a single variable and took 
no account of the relation between two variables. 

We need not concern ourselves now with developments in the later periods 
I have mentioned. Rather, I propose to sketch in more detail the state of sta- 
tistical theory and method in 1890 and its subsequent rapid burgeoning under 
the titanic impact of Karl Pearson. But first we should have an introduction 
to the man himself. What was the background of the man who introduced a 
whole new approach to what we now call the behavioral sciences? 

An incident from Pearson’s infancy which Julia Bell once related to me con- 
tains in miniature many of the characteristics which marked his later life. She 
had asked him what was the first thing he could remember. I cannot give his 
reply verbatim, but only my recollection of Julia Bell’s recollection of Pearson’s 
recollection. “Well,” he said “I do not know how old I was, but I was sitting 
in a high chair and I was sucking my thumh. Someone told me to stop sucking 
it and said that unless I did so the thumb would wither away. I put my two 
thumbs together and looked at them a long time. ‘They look alike to me’ I 
said to myself, ‘I can’t see that the thumb I suck is any smaller than the other. 
I wonder if she could be lying to me.’ ”’ Here in this simple anecdote we have 
rejection of constituted authority, appeal to empirical evidence, faith in his 
own interpretation of the meaning of observed data, and finally imputation of 
moral obliquity to a person whose judgment differed from his own. These char- 
acteristics were prominent throughout his entire career. 

Pearson’s father was a barrister named William Pearson, a Queen’s Council- 
lor and leader in the chancery courts. Karl remembered his father as a man of 
exceptional power f.. work, great energy, and a keen interest in historical re- 
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search, traits which the son also exhibited. Of his father Kari once said “During 
the legal terms, winter and summer, he was up at 4 a.m. to read his briefs and 
prepare his speeches for court. Home at 7 p.m., dinner followed, and bed at 
9 p.m. Only in the vacations did we really see him; then he was shooting, fishing, 
sailing with a like energy which astonished me even as an active boy.” 

In Karl’s educational history as described by his son Egon Pearson there are 
indications of phenomenal range of interests, unusual intellectual vigor, delight 
in controversy, determination to resist what he considered misdirected author- 
ity, and urge to self-expression, but there is almost no suggestion of any special 
leaning toward those studies for which he is now chiefly remembered. At the 
age of 18 he obtained a scholarship at King’s College, Cambridge, being second 
on the list of applicants, and 4 years later took his degree with honors in mathe- 
matics. Among his interests at this time, besides mathematics, were poetry, 
philosophy, the theory of elasticity, the history of religion, and a search for a 
concept of the Deity which he could find consistent with what he knew of 
science. He had already challenged the college authorities by his refusal to at- 
tend compulsory divinity lectures, and after a hard fight—which he undoubt- 
edly enjoyed—actually carried his point. Two years after taking his degree at 
Cambridge he had read law and been admitted to the bar. 

There followed travel in Germany and a period of study in the Universities 
of Heidelberg and Berlin where he balanced the study of physics with that of 
metaphysics, of Roman law with the history of the Reformation, of German 
folklore with socialism and Darwinism. In England he was soon lecturing and 
writing on German social life and thought, Martin Luther, Karl Marx, Mai- 
monides and Spinoza; contributing hymns to the Socialist Song Book, and 
showing himself “a young man, clever, self-confident, with an unusually clear 
grasp of his subject, who, where he . . . [disagreed] was often too eager to rush 
in and try to set the world aright.” His first publication at age 23 was a little 
book written in the form of letters from a man named Arthur to his fiancee. I 
shall quote a brief passage from what must have been largely autobiographical. 

“I rush from science to philosophy, and from philosophy to our old friends the poets; 
and then overwearied by too much idealism, I fancy I become practical in returning 
to science. Have you ever attempted to conceive all there is in the world worth 
knowing—that not one subject in the universe is unworthy of study? The giants of 
literature, the mysteries of many-dimensional space, the attempts of Boltzmann and 
Crooks to penetrate nature’s very laboratory, the Kantian theory of the universe, 
and the latest discoveries in embryology, with their wonderful tales of the develop- 
ment of life—what an immensity beyond our grasp! Mankind seems on the verge of a 
new and glorious discovery. What Newton did to simplify the various planetary 
notions must now be done to unite in one whole the various isolated theories of mathe- 
matical physics.” 


All that young writer wanted was a complete understanding of the universe. 
Looking forward thirty years to the first issue of Biometrika we are reminded 
of the motto on its frontispiece, under a picture of a statue of Charles Darwin: 
“Ignoramus, in hoc signo laboremus.” Those five words ring out like the basic 
theme of Karl Pearson’s life, “We are ignorant, so let us work.” 

In 1884, when he was 27 years old, Pearson was appointed to the chair of 
Applied Mathematics and Mechanics ‘at University College, London. For the 


SRE. RAR 5 Coe 














THE CONTRIBUTIONS OF KARL PEARSON 15 


next 6 or 7 years he was occupied with teaching mathematics to engineering 
students, writing papers on elasticity, completing Isaac Todhunter’s unfinished 
book A History of the Theory of Elasticity and the Strength of Materials from 
Galilei to the Present Time, lecturing on socialism and free thought, publishing 
a book on Ethics of Freethought, writing in German (Die Fronica) a historical 
study of the Veronica legends concerning pictures of Christ, collecting material 
on the German passion play which later formed the substance of The Chances 
of Death, completing a book called The Common Sense of the Exact Sciences 
which had been begun by his predecessor W. K. Clifford and taking an active 
part in a small club of men whose avowed purpose was to break down the 
conventional barriers which prevented free discussion of the relations between 
men and women. J. B. 8. Haldane comments on the Pearson of the year 1890 
thus: 


“If, in 1890, one had had to pass judgment on Pearson, it might have run as follows, 
‘He is a first rate teacher of applied mathematics. He is somewhat of a radical, but 
he is only 33 years old. He will settle down as a respectable and useful member of 
society, and may expect a knighthood if he survives to 60. He will never produce 
work of great originality, but the college need not be sorry to have appointed him.’” 


In 1890 this appraisal of Pearson’s future would have been realistic, but he 
was then approaching a dramatic change in his professional life. 

In 1890 Pearson was appointed Lecturer on Geometry at Gresham College 
with freedom to choose the subject matter on which he would lecture. In March 
of the following year he delivered his first course of four lectures on “The Scope 
and Concepts of Modern Science.” In 1892 the material of these and other 
lectures, revised and enlarged, was published as the first edition of that amazing 
book, The Grammar of Science, and in 1893, he wrote the memoir on “Asym- 
meirical Frequency Curves.” It is apparent that a very important change has 
taken place in his concepts of scientific method, that he has reached a new con- 
viction about the statistical aspects of the foundations of knowledge, and that 
for him problems of heredity and evolution have acquired a new urgency. 
Why? 

In 1890, W. F. R. Weldon was appointed to the chair of biology at University 
College. Weldon was already acquainted with Francis Galton and already en- 
gaged in statistical research. In 1890 Weldon published one paper on variations 
in shrimps, in 1892 a second paper on correlated variations, and in 1893 a 
third paper which contains the sentence “It cannot be too strongly urged that 
the problem of animal evolution is essentially a statistical problem.” In 1893 
that point of view was heresy. The scientific importance of the intense personal 
friendship which soon sprang up between Pearson and Weldon, then both in 
their early thirties, can scarcely be exaggerated. Weldon asked the right ques- 
tions and they drove Pearson to some of his most significant contributions. 
Weldon’s sudden death from pneumonia at the age of 46 was “a heavy blow to 
science and to Pearson the greatest personal loss of his life.” 

In 1889 Francis Galton, then 67 years old, published Natural Inheritance, 
and this book unquestionably marked the beginning of a new epoch in the 
history of science. Its publication induced Weldon, Edgeworth, and Pearson to 
study correlation and to recognize its immense importance for many new fields 
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of inquiry. In his “Notes on Correlation” Pearson later wrote, “So far as I can 
remember what happened at all, it was as follows. I know that I was immensely 
excited by Galton’s book of 1889 . . . and that I read a paper on it in the year 
of its appearance. In 1891-92 I lectured popularly on probability in Gresham 
College, taking skew whist contours as illustrations of correlation. In 1892 I 
lectured on variation, in 1893 on correlation to re. arch students at University 
College, the material being afterwards published as the first four of my Philo- 
sophical Transactions on evolution. At this time I dealt with correlation and 
worked out the general theory for three, four, and ultimately n variables. The 
field was very wide, and I was far too excited to stop to investigate properly 
what other people had done. I wanted to reach new results and employ them.” 
Pearson did not become personally acquainted with Galton until the latter was 
70, which would be in 1892. It is likely that Weldon introduced them, although 
I can find no documentation for that statement. The stimulation Pearson re- 
ceived from Galton and the devotion he felt toward the older man show on 
every page of the four volumes of The Life, Letters and Labours of Francis 
Galton, which is one of the world’s great biographies. On this work of over 1300 
quarto pages with about 170 full page plates, Pearson lavished some 20 years 
of work and much of his personal fortune. 

Having established the year 1890 as marking a turning point in Pearson’s 
scientific career, let us have a look at the status of statistical work in that year 
and at the changes brought about in the next few years by Pearson and his 
associates. 

We might begin with the matter of data. Surrounded as we are now by vast 
compilations of data, it is difficult for most statisticians to believe there was a 
period not long remote in which the verification of statistical hypotheses was 
hampered by the lack of observed data. 

The gathering of public statistics by governments and semipublic agencies 
was well established. Since about 1800 most of the countries then important 
had been taking official censuses. Data were available on population, immigra- 
tion and emigration, births, deaths and marriages, crimes, suicides, imports and 
exports, agriculture, forestry, land ownership, price and supply of commodities, 
insurance, temperature and barometric variation, governmental expenditures, 
and the like. Several nongovernmental societies had been set up chiefly for the 
purpose of improving the quality of public statistics, as, e.g., the Statistical 
Society of London in 1834 and the American Statistical Society in 1839. 

Insurance of various kinds has an ancient history, the first English statute 
on that subject having been passed in 1601. As early as 1838 Augustus de 
Morgan had written an Essay on Probabilities and their Application to Life 
Contingencies and Insurance Offices. Actuarial work had become a fairly well 
developed and respected profession. 

The 25 years from 1853 to 1878 was the era of the great international sta- 
tistical congresses. Economic statistics moved ahead greatly in this period, with 
notable improvements in methods of gathering data. 

Governments were beginning to take physical measurements on their soldiers 
and these data could be obtained by anthropometrists. The great astronomers 
of the first half of the 19th century not only developed a great deal of mathe- 
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matical theory regarding probability and the adjustment of observations, but 
they also recorded and sometimes published astronomical observations, records 
of temperature and barometric pressure and the like. 

Outside these areas, however, there were few data readily available and few 
persons had yet attempted to gather data for the solution of a specific problem. 
When Daniel Bernoulli wrote on the advantages of inoculation as a preventive 
of smallpox and when Poisson wrote on the application of probability theory 
to the credibility of testimony, they had no empirical data, only a priori 
assumptions. Two men, Henry Pickering Bowditch of the United States, in 
1877, and Hermann Ebbinghaus of Germany, in 1885, each gathered extensive 
data for the solution of a particular problem, but neither one made a second 
statistical study nor took steps to promote the statistical approach to practical 
research problems in general. Bowditch, who was a professor in Harvard Uni- 
versity and president of the Boston school board, wanted to improve school 
practice with respect to growing children, so he made records of 24,500 children 
in respect to age, height, weight, nationality, and parental occupation. In the 
analysis of his data he narrowly missed discovering regression, if not correla- 
tion. But he did miss, and so far as I know he influenced no one else to make 
similar studies. Ebbinghaus in his study of memory made what was probably 
the first statistical study in the field of experimental psychology. 

Most of Pearson’s predecessors were either pure mathematicians who wrote 
about the applications of probability theory to human life without ever staining 
the pages of their treatises with concrete data, or they were men who worked 
in statistical agencies tabulating data without much knowledge of mathe- 
matical analysis. Notable exceptions were the versatile Adolphe Quetelet of 
Belgium, the insightful Wilhelm Lexis of Germany, the lucid F. Y. Edgeworth 
and the many-talented Francis Galton of England. These men put high value 
on concrete data but the amount of data they had access to was paltry beside 
what soon began to be collected by Pearson and his associates. Always Pearson 
insisted on publishing the data as well as the derived statistics. When Pearson 
developed a new piece of statistical theory his purpose was to find a better way 
of discovering something about life and he immediately used his theory on data. 
His mathematics might be cumbersome but he was not primarily performing a 
mathematical exercise. He was developing a methodology for the exploration 
of life. , 

Before Pearson there were many writers who dealt with measures of variation 
among errors made in the observations of a single value. The concept of true 
variation among individuals is quite a different idea. It was familiar to Galton, 
Lexis, Edgeworth and Quetelet. This idea of individual variation was studied 
extensively by Pearson and is one of his genuine contributions to science. 

The very language of statistical writing is still replete with terms coined by 
Pearson: array, bi-serial r, chi-square, coefficient of contingency, coefficient of 
variation, correlation ratio, kurtosis and its relatives leptokurtic, mesokurtic, 
and platykurtic, mean square contingency, moment coefficient, multiple 
correlation, partial correlation, normal curve, product moment, scedasticity 
and its relatives homoscedasticity and heteroscedasticity, standard deviation, 
and sigma as symbol for the standard deviation. 
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One of the problems on which Pearson made a major investment of time and 
energy was that of deriving a generalized frequency curve based on a single 
differential equation, with parameters obtained by the method of moments. If 
his elaborate and extremely interesting system is seldom used today and if the 
fitting of curves to distributions of observed data no longer holds a prominent 
place among statistical techniques, nevertheless in its day it served a useful 
purpose. At the very least it was a corrective for the almost rapturous over- 
statements of Quetelet as to the universal applications of the normal law to 
social phenomena. 

Having fitted a curve to a set of observations Pearson needed a criterion as 
to how good the fit was, and so he invented chi-square. Quetelet and others 
who wanted to demonstrate the closeness of agreement between the frequencies 
in a distribution of observed data and frequencies calculated on the assumption 
of normal probability merely printed the two series side by side and said in 
essence, “Behold!” They had no measure of discrepancy and so far as I know 
they were not made uncomfortable by the lack of such a measure. Pearson not 
only devised the measure but he worked out its distribution and had it calcu- 
lated. His understanding of the number of degrees of freedom appropriate to 
its use is not in accord with present day concepts and as late as 1922 he was 
still contending vigorously that for p categories the number of degrees of free- 
dom must always be p—1. Chi-square has turned out to be an enormously 
useful device with a range of applications far greater than the specific problem 
for which it was created. 

If asked what was Pearson’s chief contribution to statistical techniques, 
many persons would promptly say “Correlation. The coefficient of correlation 
is called the Pearson r because he discovered it.” They would be both right and 
wrong. The idea of correlation is due to Francis Galton who published a paper 
on “Co-relations and their measurement chiefly from anthropometric data” 
in 1880 and another on “Regression toward mediocrity in hereditary stature” 
in 1885, and who gave a more widely read statement in his book on Natural 
Inheritance in 1889. Before Galton, several mathematical astronomers had 
worked out the mathematical equation of the bivariate normal distribution, 
among them Robert Adrain in 1808 for independent variables, Giovanni Plana 
in 1812, Gauss in 1823, and Bravais in 1846 for related variables. None of them, 
however, was seeking a measure of relationship and none of them called atten- 
tion to the product term in the exponent as having special significance. Never- 
theless Spearman, who did not like Pearson, referred to the correlation coeffi- 
cient as “Bravais’ function.” The difference between the goal of the mathe- 
matical writer of probability and that of the statistician seeking answers to 
practical problems is made strikingly clear by an examination of Emanuel 
Czuber’s Theorie der Beobachtungsfehler, published two years after the appear- 
ance of Galton’s Natural Inheritance. In Part III Czuber discusses errors in a 
plane and in space, and extends Bravais’ work to the normal correlation surface 
in three dimensions. He does not mention Galton, or exhibit any interest in a 
measurement of relationship. In the entire book there are no data. 

Galton had been seriously hampered in his study of correlation both by lack 
of data and by lack of an efficient routine of computation. He had data on sweet 





THE CONTRIBUTIONS OF KARL PEARSON 19 


peas and on stature of parents and adult offspring for 200 families. Immediately 
Weldon began to make measurements on shrimps for correlation studies, Edge- 
worth wrote a paper on correlated averages, Bryan (in America) made a correla- 
tional study of nearly 800 school children, and Pearson began to lecture on 
correlation. 

In Pearson’s first fundamental paper on correlation entitled “Regression, 
heredity, and panmixia,” written in 1895 from his lecture notes, he generalized 
Galton’s conclusions and methods, derived the formula which we now call the 
“Pearson product moment” and two other equivalent formulas, gave a simple 
routine for computation which could be followed by a person without much 
mathematical training, stated the general theory of correlation for three vari- 
ables, gave the coefficients of the multiple regression equation in terms of the 
zero order correlation coefficients. . 

There followed a series of great memoirs on various aspects of correlation, 
some by students or associates of Pearson such as George Udny Yule and 
W. G. Shepherd, but most from his own hand. These dealt with such matters 
as correlation in nonnormal distributions, tetrachoric r, correlation between 
ranks, correlation when one or both variables is not scaled, when regression is 
not linear, and the like. There were many papers presenting the results of cor- 
relation analysis in a great variety of fields. And a large amount of labor went 
into the derivation of the probable error of each of these various coefficients and 
the tabulation of various probabilities related to correlation. It is a sound 
commentary to name the coefficient the “Pearson r.” 

If Pearson himself had been asked which part of his work would be of en- 
during value I surmise he might have mentioned, among other things, the 
derivation of the probable errors of what he called “frequency constants,” and 
various tables to facilitate the computation of such. Pearson introduced the 
term “standard deviation” and the symbol ¢ in that first great paper on asym- 
metrical frequency curves, and he used this term and symbol when discussing 
variation among individuals. When speaking of the sampling variability of a 
statistic I think he always used the probable error, never the standard error. 

His method was to write the equation for a statistic, take the differential 
of both sides of that equation, square, and sum, and reduce the result by any 
algebraic devices of which he could think. The process of reduction was often 
formidable. Even though he was not much concerned with the distinction be- 
tween a statistic and its parameter, and frequently used the former in place 
of the latter, these probable errors marked a great advance over the previous 
complete inability to measure the sampling variability of most statistics. In 
this era new statistics were being proposed on every side and the amount of 
energy which went into the derivation of these probable errors was tremendous. 
With the successful search for exact sampling distributions which has been 
under way ever since the publication of R. A. Fisher’s 1915 paper on the 
sampling distribution of the correlation coefficient, we now have better methods _ 
than most of these probable error formulas could provide. 

The first edition of Biometrika (1900-1901) referred editorially to the urgent 
need for tables to facilitate the work of the statistician and biometrician and 
promised that such tables would be produced as rapidly as possible. Such 
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tables as were then available were in widely scattered sources, some of them 
almost impossible to obtain. Readily available in 1900 were tables of the bi- 
nomial coefficients, the trigonometric functions, logs and antilogs. A large 
table of the logs of factorials had been computed in 1824 (C. F. Degen) but 
it was almost unknown. There were tables of squares, cubes, square roots, and 
reciprocals of which Barlow’s is the best known, and there were multiplication 
tables by Crelle and by Cotsworth. Legendre had published a table of logs of 
the gamma function but copies were very scarce. The normal probability 
function had been extensively tabulated but always with either the probable 
error (.6745c) or the modulus (c+/2) as argument, never the standard error. 
Poisson had not tabulated the distribution which bears his name but Bortke- 
wicz had done so in 1898. 

A list of the tables which have been printed in Biometrika from its second 
issue in 1902 until the present, or which have appeared in the separate volumes 
of Tables for Statisticians and Biometricians or in the Draper’s Company Series 
of Tracts for Computers would be a very long one. Some of these tables are no 
longer used, others appear to be timeless in value, even after the advent of the 
electronic computer. The Tables of the Incomplete Beta Function were Pearson’s 
final contribution to science, published when he was 78 years old. 

The young sciences of biometry and mathematical statistics were given 
great impetus by the founding of Biometrika in 1901 for now there was a 
periodical in which important memoirs on the behavioral sciences could be 
published even if those memoirs were mathematicel. The Biometrics Labora- 
tory and the Eugenics Laboratory of the University of London, which united 
in 1911 into the Department of Applied Statistics with Karl Pearson as its first 
Galton Professor, provided a center to which leading statisticians and promising 
young students came from all over the world. Its influence would be hard to 
exaggerate. 

The frequent controversies in which he was embroiled cannot be disregarded 
in any serious discussion of Pearson’s lifework. In fact I have met young statis- 
ticians who know little about his vast contributions but have heard the com- 
ment that he held back the development of the field by violent and disputatious 
resistance to new ideas. This is not the time to discuss the validity of the argu- 
ments in any of these conflicts, even if I felt competent to do so, but there are 
some general remarks which should be made. 

There is no question but that he was almost always engaged in a first-class 
dispute with someone and I suspect that he found these verbal battles stimu- 
lating. In some recollections of his Cambridge days published the last year of 
his life, he spoke of “pleasure in friendship, pleasure in fights, pleasure in the 
coaches’ teaching, and pleasure in searching for new lights as well in mathe- 
matics as in philosophy and religion.” I think we may assume that the fights 
were verbal and that if he had come off second-best he would not have recalled 
them with pleasure. 

Always he regarded himself, I think, as a sort of crusader, and among the 
qualities a crusader needs are superb self-confidence, the courage to fight for 
his convictions, and a touch of intellectual intolerance. In his youth, Pearson 
did battle for such unpopular radical ideas as socialism, the emancipation of 
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women, and the ethics of free thought. A few years later he was involved in a 
long struggle for the unpopular idea that mathematics should be applied to the 
study of biology. After one of the early skirmishes in that struggle he wrote 
to Galton, “I always succeed in creating hostility without getting others to see 
my views; infelicity of expression is I expect to blame. To you I mean to speak 
them out even at the risk of vexing you. ... All the problems laid down by 
you in your printed paper seem to me capable of solution, and nearly all of 
them in one way only, by statistical methods, and calculations of a more or less 
mathematical kind. The older school of biologists cannot be expected to ap- 
preciate these methods. . . . A newer generation is only just beginning its train- 
ing in them.” Much bitterness arose over this question and the Royal Society, 
while ready to accept papers dealing with eiiher mathematics or biology, refused 
to accept papers dealing with both. That refusal was one of the facts which led 
to the founding of Biometrika. 

Beginning in 1907 the Eugenics Laboratory published numerous very sub- 
stantial statistical memoirs on three of the most controversial social issues of 
the day, namely pulmonary tuberculosis, alcoholism, mental deficiency and 
insanity, and the conclusions reached by these studies challenged many cur- 
rent theories and practices of the medical profession. The titles of the two series 
under which these papers appeared were not exactly calculated to win friends 
and influence people, one being “Studies in National Deterioration” and the 
other “Questions of the Day and of the Fray.” The angry reaction of medical 
authorities and public officials was violent. 

These major prolonged struggles with organized groups were probably necos- 
sary to permit the young sciences of biometry and statistics to break the re- 
straining bonds of apathy, of ignorance, of intrenched authority. However 
Pearson’s many clashes with individual scientists did not serve the same pur- 
pose and may in some instances actually have impeded the advance of statis- 
tics. The acrimonious altercations must have diverted considerable energy from 
scientific pursuits. Sometimes the exchange of polemics reminds one of the con- 
flicts between the two elder Bernoullis or between the followers of Newton and 
the followers of Leibniz. These conflicts are easier to understand when we 
remember that Pearson was a perfectionist and also that he was trained for a 
legal career and from childhood had had in his father an example of a successful 
trial lawyer. Sometimes the verdict of history now seems to be on Pearson’s 
side of the argument, and sometimes on the other side. One must feel a certain 
regret that in his zeal for what he believed to be truth he did not recognize that 
fusion is more powerful than fission. 

If intellectually convinced of an error, Pearson was ready to admit it. He 
once published in Biometrika a paper called “Peccavimus,” “We have erred.” 
L. N. G. Filon, speaking at a dinner in honor of Pearson once said, “K. P. 
lectured to us on the mathematical theory of statistics, and on one occasion 
wrote down a certain integral as zero, which it should have been on an accepted 
general principle. Unfortunately I have always been one of those wrong- 
headed persons who refuse to accept the statements of professors, unless I can 
verify them for myself. After much labour, I actually arrived at the value of 
the integral directly—and it was nothing like zero. I took this result to K. P. 
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and then if I may say so, the fun began. The battle lasted, I think, about a 
week, but in the end I succeeded in convincing Professor Pearson. It was 
typical of K. P. that, the moment he was really convinced he saw the full con- 
sequences of the result, he proceeded at once to build up a new theory which 
involved scrapping some previously published results and generously associated 
me with him in the resulting paper.” 

I should like to quote also a sentence out of a letter to Pearson from Bateson 
at the beginning of their long controversy, “I respect you as an honest man, 
and perhaps the ablest and hardest worker I have met, and I am determined not 
to take up a quarrel with you if I can help it. I have thought for a long time 
that you are probably the only Englishman I know at this moment whose 
first thought is to get at the truth in these problems. .. . ” 

While trying to prepare this paper I have struggled with an almost paralyzing 
attack of procrastination. So many pencils were sharpened, so many unim- 
portant errands allowed to interrupt, so many wastebaskets filled with trivia, 
that I finally admitted the hesitancy in my writing hand was abnormal. No 
matter what I put on paper it seemed to merit the comment K. P. himself once 
made in Biometrika about a paper I had written on Giovanni Plana, “She has 
both overestimated and underestimated his contributions.” I was not helped 
by rereading a letter he wrote me while I was working on the history of statis- 
tics: “I do indeed admire your courage. . . . However you can attempt and often 
achieve big tasks on your side of the Atlantic which are not possible for us 
here.” 

Appraising the many facets of the work of such a man is not made easier by 
the fact that he is no longer able to issue a vigorous rejoiner, pointing out the 
inevitable omissions, exaggerations and mistakes in interpretation. Now my 
worst fears have been realized and I am vividly aware that I have dealt sketch- 
ily with some very large issues and have been able to create only a faint im- 
pression of the contributions of a man to whom no smaller word than “titan” 
is appropriate. 

Although Karl Pearson made contributions to statistical technique which 
now appear to be of enduring importance, these techniques are not the principal 
reason why we should celebrate the centenary of his birth. What he did in 
moving the scientific world from a state of sheer disinterest in statistical studies 
over to a situation in which a large number of well trained persons were eagerly 
at work developing new theory, gathering and analyzing statistical data from 
every content field, computing new tables, and reexamining the foundations 
of statistical philosophy, is an achievement of fantastic proportions. His con- 
cept of a general methodology underlying all science is one of his great contri- 
butions to the world. His laboratory was a world center in which men from all 
countries studied and from which they returned to set fires in their own 
homelands. 

Few men in all the history of science have stimulated so many other people 
to cultivate and to enlarge the fields they had planted. 














KARL PEARSON—AN APPRECIATION ON THE 
100TH ANNIVERSARY OF HIS BIRTH* 


Samve. A. STOUFFER 
Harvard University 


T WAS 26 years ago, as a student of sociology with a fresh Ph.D., that I met 

Karl Pearson. The Social Science Research Council had given me some time 

in England, to learn things which might help in applying statistics to sociology. 

I had the privilege of personal contact for several months with R. A. Fisher at 

Rothamsted, attended a seminar of A. L. Bowley at the London School of 

Economics, and for a year attended the lectures both of Karl Pearson and of 
his son Egon at University College. 

I wish I could communicate to you, and especially to those of you who are 
just now beginning your professional careers in a world of Statistics incredibly 
more sophisticated than that of Karl Pearson’s day, something of the thrill 
in meeting in person and studying under a man of Pearson’s immense reputa- 
tion. Author of the Grammar of Science; perfecter of simple linear correlation ; 
inventor of multiple and partial correlation, of curvilinear correlation, of tetra- 
choric and bi-serial correlation; discoverer of the x?-function for summarizing 
multinomial data with magnificent simplicity; builder of a beautiful system 
of frequency curves derived from a single differential equation which in turn 
harked back to the hypergeometric series; founder of Biometrika and author 
or co-author of a prolific literature applying these new statistics to biological 
and sociological data—Karl Pearson was a hero of Asgard to an Ameri- 
can boy vouchsafed a visit to the home of the gods. Indeed, Pearson was 
Thor himself—for the thunderbolts with which he attacked unsparingly those 
who dared oppose him still were echoing and reechoing. His battles to defend 
Galton’s Law of Ancestral Inheritance against the heresy of rediscovered 
Mendelism may have faded somewhat in the past, but his fulmination against 
those who questioned his faith in nature as against nurture and dared to chal- 
lenge voluminous correlational data which he and his co-workers had amassed 
were still audible, and the air was now electric with conflict as he stubbornly 
defended his statistical system against the threats of the Fisherian revolution. 

Pearson wrote with a goose quill pen, which he sometimes seemed to have 
dipped in ink mixed with acid. To give you just one example. In 1910, two 
economists from Cambridge, John Maynard Keynes and Alfred Marshall, 
challenged his interpretation of data on the influence of parental alcoholism on 
the physique and ability of offspring, the former in a paper in the Journal of 
the Royal Statistical Society and the latter in a letter to the Times. Pearson 
counterattacked, with data, reasoning, and arguments ad hominem, in Pam- 
phiet No. 1 of a series entitled Questions of the Day and of the Fray.' 





* A paper read in Atlantic City, September 11, 1957, at a joint meeting of the American Statistical Association 
and the Institute of Mathematical Statistics. 

1 Department of Applied Mathematics, University College, London. Questions of the Day and of the Fray, No. 
1. Supplement to the Memoir entitled “The Influence of Parental Alcoholism on the Physique and Ability of the 
Offspring.” A reply to the Cambridge Economists, by Karl Pearson. 
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“I fear that our Cambridge economists,” he wrote in his 26-page reply, “live in a 
world of their own, quite out of touch with the true social conditions of large sections 
of the British population. The conditions depicted may be deplorable, we were not 
discussing that subject, but the ignorance of their representative character would, we 
should imagine, even hamper the usefulness of an English economist.” 


And he drops this footnote: 


“That they should be used as an argument in a journal which claims to represent 
British statistical knowledge makes me gravely doubt the adequacy of the editorial 
management. This view is confirmed by Mr. Keynes’ treatment of anthropometric 
data.” 


Pearson was particularly irked by Marshall’s phrase referring to him as a 
“mathematical outsider.” 

“Some day,” he wrote, “Cambridge may awake to the fact that a school of modern 
statistics may help to raise to a still higher level its biology, its anthropology, its 
medicine, and even its economics . .. . Had the Cambridge economists devoted the 
last fifteen years to the development of an adequate taeory of statistics and the last 
ten largely to its application to man in his social relations, they would probably have 
less hostility and use slightly more restrained language when they find ‘mathematical 
outsiders’ venturing to attack problems for the solution of which those economists 
have neither themselves provided adequate data nor devised instruments by which 
they might handle effectively the material collected by others.” 


Pearson never became a member of the Royal Statistical Society and he 
attended only one meeting of the Royal Society after the bitter controversies 
over Mendelism at the turn of the century. 

What manner of man was this Thor? Of course, I do not pretend really to 
know. But never could one be more surprised than was the young American 
student of sociology who entered his classroom in 1931. Here was a warm, 
friendly, and above all gentle person. An old man now, 74, though he looked 
and acted many years younger. There was dedication, yes, to issues to which 
he was devoted, but his cause above all others was the direct and indirect de- 
velopment of scholars to master statistical theory and use it in practical applica- 
tions. His lectures were largely algebraic derivations and demonstrations of his 
fundamental statistical ideas—the system of frequency curves, for example— 
and were characterized always by lucidity and sometimes by eloquence. As I 
look back I remember the kindliness of his smile and can recall only two or three 
occasions when there were touches of bitterness or acidity. 

He was most considerate of his students. When I happened to suggest a 
problem one day in which I was interested, and showed him a partial solution, 
he later carried it to his class and devoted several lectures to a new algebraic 
development which he worked out over a week-end. He asked me to share in its 
publication, and I collected numerical illustrations, calculations x? to five sig- 
nificant figures on a Brunsviga. We went over the manuscript together at his 
country place in Surrey and took a long walk together through the countryside, 
with its view of the Downs. He walked briskly—and once even paused to ask 
smilingly if the going was too swift for me. The only note of asperity came when 
he adjured me not to mention the paper when at Rothamsted. There was no 
hint of dissatisfaction with my small help, but when the paper finally appeared 
in Biometrika, I noted somewhat wryly that he had recalculated all of my x”s 
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to eight significant figures. And he had probably done it himself, on his own 
Brunsviga. 

Because it was so hard for me to reconcile this sweet and gentle, if stubborn, 
personality with the writer of diatribes, I noted with special interest, in the 
Memoir by his son Egon,’ a letter written to Pearson by Henry Bradshaw, one 
of his earliest friends, way back in 1883. Pearson had become involved. ia a 
rather bitter controversy in connection with an exhibition at the British 
Museum. 

“You have such a fund of strong gentleness in you,” wrote Bradshaw, “that it is 
most cruel to see the immense power that this gentleness possesses simply wasted by 
& perversity as noxious as anything to be found even on the staff of the British 
Museum. I hope you will come on Tuesday, and you can sit on me to your heart’s 
content.” 


Pezrson was a prodigious and compulsive worker. I remember asking him 
once how he had time to write so much and compute so much. “You Americans 
would not understand,” he replied. “But I never answer a telephone or attend 
a committee meeting.” Indeed, there was no telephone in the Biometrics Labo- 
ratory, but the walls were crowded with engravings of portraits of men honored 
in the history of science and with mottoes which Pearson had culled from the 
treasury of the past. Tea was a relaxing interval, and a ceremonial occasion like 
the Galton dinner, at which the Professor annually offered a toast “In pious 
memory of Francis Galton,” followed by other toasts, was a ritc not to be 
forgotten. Just before offering a toast to postgraduate and student workers at 
one of the last Galton dinners before Karl Pearson’s retirement, I tremblingly 
spilled my claret all over the brand new boiled shirt I had bought for the dress 
occasion. Characteristic British courtesy of Mrs. Pearson, who was seated be- 
side me, somewhat relieved my embarrassment. Memorable as was the spirit of 
the Laboratory, the heating facilities left something to be desired. When the 
winter temperature dropped to the forties, I formed the habit of slipping out 
quietly and across town to the warmer London School of Economics and, 
incidentally, doing my computations on a Monroe calculator, which had cer- 
tain advantages over the Brunsviga, although they were not appreciated, may 
I say, by Karl Pearson. 

Others are better equipped than I to evaluate the impact of Karl Pearson on 
statistical theory and the significnace for today of the great number of sta- 
tistical tools which he forged. Without doubt he failed to appreciate the full 
implications of the small sample theory which was to revise, though often only 
in detail, some of the major structures of his invention. And still later develop- 
ments—as in stochastic processes, in decision theory, and in the consequences 
of the theory of games—perhaps owe relatively little to his influence. 

I shall speak only as to the impact of Pearson on sociology. And these re- 
marks must be brief. 

First, and most important, is the fact that the statistical tools he forged, in- 
cluding some which have today been superseded, made it possible for sociology, 
especially in America, to take great forward strides in the use of quantitative 
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empirical data. The influence was felt most strongly perhaps at Columbia, 
where students of Giddings like W. F. Ogburn, Frank A. Ross, F. Stuart 
Chapin, T. J. Woofter, and Stuart A. Rice pioneered in applying correlation 
and other Pearsonian concepts freshly to masses of data—especially Census 
data. Ogburn and later Ross edited the Journal of the American Statistical 
Association for many years. They in turn influenced a new generation of sociolo- 
gists, notably at Columbia, Chicago, and Minnesota, and one has only to com- 
pare the American Journal of Sociology or American Sociological Review of today 
with publications prior, say, to about 1925, to see how great a change has been 
wrought. 

Change continues, and more and more we are seeing reports in our sociologi- 
cal journals of research of a different kind—even involving controlled experi- 
ments. Dissatisfaction with correlation, as a half-way measure, is increasing 
and a more rigorous kind of proof is being demanded and sometimes supplied. 
And for this, new statistical tools are often needed and are being produced. 
Yet, even if we use different degrees of freedom in calculating x? than Pearson 
would have approved, the basic idea in this and many of our measures still is 
his. 

As to Pearson’s contribution to theoretical strategy in sociology, I fear at the 
moment that it is not easy to observe. His overweening faith in his correlation 
coefficients as demonstrating the unimportance of the environment as com- 
pared with heredity has a quaint, antiquarian ring. Two generations of cultural 
anthropology and of learning experiments in psychology have marshalled vast 
bodies of data attesting to the plasticity of the human organism. I know no 
sociologist who would explain criminality or propensity to criminality as an 
inherited trait, when a parsimonious and often demonstrable explanation of 
many, though not all, criminals is in terms of conformity to norms of a sub- 
culture. The concepts of role, and of reference systems which reward or punish 
behavior in various roles, now seem to be fruitful tools of inquiry. Pearson 
vigorously denied the existence of “pure” races, but his views on the inherent 
inferiority of the Negro or the American Indian find no respectable scientific 
support today. 

An essay like Social Problems: Their Treatment, Past, Present and Future, 
published in 1912,* which bears much on the future of sociology, contains an 
eloquent and discerning plea for measurement to replace essay writing and 
folklore, a plea which was well ahead of its time and even could bear re-reading 
today—along with an exposition of racial and eugenical doctrines which could 
hardly be more repellent to the contemporary sociologist. Yet, on this hun- 
dredth anniversary of Karl Pearson’s birth, I wonder whether it may not be 
worth a pause to ask ourselves why we are so sure he was hopelessly wrong. 

Let us examine a paper, which generated long and heated controversy, on 
the subject of tuberculosis. Pearson argued from his data that contemporary 
treatment of tuberculosis, including sanatoria and rest, was not reducing the 
death rate and would not. The basic problem, as he saw it—because almost all 
people got tuberculosis in their lifetimes but most recovered from it withous 
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overt symptoms—was that the majority had the inherited constitutional 
strength to throw off the attack while the minority were weak enough to be- 
come overtly ill. The solution lay in forbidding people with a history of overt 
tuberculosis in their pedigrees to marry; thus we could artificially speed up the 
Darwinian process of natural selection, and eventually breed a resistant stock. 
We smile at this, because we know that mortality from tuberculosis has since 
been reduced to a tiny fraction of what it was—with the aid of rest, diet, drug 
treatments, surgery, etc. Pearson was wrong, of course, in his conclusions, but 
was he necessarily wholly wrong in some of his premises? Do we know that 
there are not individual differences in body chemistry, which may indeed be 
inherited, that affect ability to cope with invading micro-organisms? We per- 
haps never shall know with respect to polio, for example, because happily the 
triumphs of immunology may make exposure to that disease as rare as exposure 
to smallpox. 

By the same token, let us ask whether we may not be closing our eyes to 
constitutional differences in temperament and in ability, simply because they 
are not as manipulable as are differences attributable to learning and culture. 
The human plasticity is so great that sociologists and psychologists have plenty 
of room for manipulation. But if our thinking and our research should lead 
us to treat as constants all that which is not easily manipulable, we may some- 
time be in serious scientific difficulty. What is most easily manipulable may 
properly be the first target of scientific attack, but ease of manipulability is 
not the only admissible criterion in the strategy of the scientist. Moreover, on 
moral as well as scientific grounds the eugenicists have not spoken their last 
word. I am sure this is the case, in spite of the untenability of many of their 
initial assumptions—for example, the assumption of a Mendelian unit char- 
acter for intelligence made by Davenport, which assumption, incidentally, was 
relentlessly attacked by Pearson. 

I suggest, of course, no such idle watchword as “back to Pearson”—even if 
it might be slightly excusable on an occasion in which sentiment is permissible. 
Some of Pearson’s statistical thinking is still as current as the latest automatic 
Monroe calculator or IBM 650. Some is as obsolete as his beloved Brunsviga. 
We are not going back to the Brunsviga when something better is available. 
Nor, even if we who are sociologists can profit by asking why we are now over- 
looking variables which to Pearson were of supreme importance, are we likely to 
go back to his strategy of sociology. But there is much to gain in pausing 
occasionally to review the steps by which science has progressed. And there 
should be inspiration to all of us when disciplines like ours take time out to 
salute the memory of one of the noblest of our pioneers. ; 





SMOKING AND LUNG CANCER: SOME OBSERVATIONS 
ON TWO RECENT REPORTS 


JosEpH BERKSON 
Mayo Clinic, Rochester, Minnesota 


“I do not reject the use of statistics, but I condemn not trying to go beyond 
them ....” CLaupE BERNARD 


H® opinion that smoking is a cause of cancer of the lung, expressed by a 
 aianer of writers and reflected in some recent public announcements 
[4, 16], rests to a considerable degree, though not entirely, on two now cele- 
brated “prospective” statistical studies, one reported by Doll and Hill [5, 6], 
the other by Hammond and Horn [10, 11]. In a commentary article in 1955, 
[1] I pointed out that the results presented in the first report of Hammond and 
Horn [11] exhibited an association of smoking with increased death rates, not 
only for cancer of the lung, but also for each of the other several categories of 
cause of death into which all causes had been segregated, namely, cancer ex- 
clusive of lung, diseases of the coronary arteries, and other diseases. In the sec- 
ond report of these authors, the association of smoking with the death rates 
from causes other than lung cancer is shown even more clearly. In the accom- 
pany ing figure [page 35], which is reproduced from their recent report [10], it 
is seen that, of the total estimated excess deaths attributed to regular cigarét 
smoking, only 13.5 per cent were from lung cancer. Disease of the coronary 
arteries accounted for about 52 per cent of the excess deaths, cancer other than 
lung cancer for 13.5 per cent, other heart and circulatory disease for 5.8 per 
cent, noncancerous pulmonary disease for 5.6 per cent, cerebral vascular disease 
for 4.8 per cent, peptic ulcer for 2.8 per cent, and cirrhosis of the liver for 1.5 
per cent. 

When I wrote my article, only the first report of the prospective study of 
Doll and Hill had been published, and in their results, based on rather few 
deaths, a similar generalized association of smoking with death rates from 
causes other than cancer of the lung was not evident. In their second report, 
based on a longer follow-up, and consequently on more reported deaths, a 
generalized association similar to that found in the study of Hammond and 
Horn can be seen. 

If what we wish to observe, when it is present, is evidence that smoking in- 
creases the probability of death from this or that disease, a simple and effective 
procedure, commonly used, is to examine an extreme, for instance in this case, 
the experience of the heavy smokers. It is reasonable to say: “If smoking in- 
creases the risk of death, it should affect heavy smokers most; the first thing 
to do is to see whether the death rates of the heavy smokers are higher than the 
rates of those who are less than heavy smokers.” One would not limit oneself 
to this comparison, of course, for a complete analysis of the character of the as- 
sociation, but for the particular purpose of discerning whether association does 
exist, this suggests itself as a sensitive method. In table 29, the death rate 
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TABLE 29 


DEATH RATES OF HEAVY SMOKERS COMPARED WITH THOSE 
OF LESS THAN HEAVY SMOKERS 














Other | Coro- 
All Lung Other | respira-| nary Other 
causes | cancer | cancer tory throm- | causes 
diseases | bosis 
Standardized death rate per 
year per 1,000 males* 
Nonsmokers 13.25 0.07 2.04 0.81 4.22 6.11 
Daily average of 1-14 gm. 14.92 0.47 2.01 1.00 4.64 6.82 
Daily average of 15-24 gm. 14.49 0.86 1.56 1.11 4.60 6.38 
Daily average of 25+ gm. 18.84 1.66 2.63 1.4 5.99 7.19 
Difference, rate of heavy smok- 
ers and rate of: 
Nonsmokers 5.59 1.59 0.59 0.60 Rey 1.08 
Daily average of 1-14 gm. 3.92 1.19 0.62 0.41 1.35 0.37 
Daily average of 15-24 gm. 4.35 0.80 1.07 0.30 1.39 0.81 
Mean difference 4.62 1.19 0.75 0.43 1.50 0.75 
Difference of rate, per cent of 
all causes 
Nonsmokers 100.0 28.4 10.6 10.7 31.7 19.3 
Daily average of 1-14 gm. | 100.0 30.4 15.8 10.5 34.4 9.4 
Daily average of 15-24 gm. | 100.0 18.4 24.6 6.9 32.0 18.6 
Mean difference 100.0 25.7 16.2 9.3 32.5 16.2 




















* From table V of Doll and Hill [5]. The death rate from all causes is less than the sum of the rates for the 
specific causes because of the inclusion of three cases of lung cancer in two categories of causes. 


of heavy smokers (25+ gm. daily average) is compared, for each of five cate- 
gories of cause of death listed by Doll and Hill [5] in their Table V, with the 
death rate of nonsmokers, that of light smokers (1-14 gm. daily average), and 
that of moderate smokers (15-24 gm. daily average). 

It is to be noted that the heavy smokers have a higher death rate than the 
nonsmokers for each category of cause of death listed, and that the difference 
in death rate is largest, not for lung cancer, but for coronary thrombosis. Com- 
paring the death rate of the heavy smokers with that of the light smokers, 
again we find that the death rate is higher for the heavy smokers with each 
category of cause of death, and that the difference in death rate is largest, 
not for lung cancer, but for coronary thrombosis. And comparing the death 
rate of the heavy smokers with that of the moderate smokers, we find that 
the rate is higher for the heavy smokers with each cause of death, and that the 
difference is largest, not for lung cancer, but for coronary thrombosis, while 
the next largest difference is not for cancer of the lung either, but for cancer 
other than lung cancer. Even as the death rate among the heavy smokers is 
highest in each of the five categories of causes of death, so it is true also, that 
for all the categories except cancer other than lung cancer, the death rate is 
smallest among the nonsmokers. 

Doll and Hill measured the differences in death rates among the different 
classes of smokers relatively, sometimes in relation to the death rate among ali 
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men, sometimes in relation to the death rate among nonsmokers. It seems al- 
most instinctive to express differences relatively in some such manner, but in a 
case like the present application, it may be misleading. There are, of course, 
situations in which such relative measures are appropriate, but it does not ap- 
pear to me that this is such a situation. When there is reason to believe that an 
increment dy, by the mechanism which produces it, is proportional to y, then 
we will measure the force producing dy in proportion to éy/y. In the present 
case dy is the increased number of deaths associated with smoking (in a stand- 
ard number of the population, in a year), and we must consider whether y 
can be the total number of deaths, or the number not attributed to smoking. 
Smoking, when increasing deaths, if it has such an effect, produces this effect, 
in the first instance, by acting on the living; why, then, should its potency be 
measured in relation to the number of deaths? Indeed, conceivably, it might 
work the other way around. If, for instance, heart disease, independently of 
smoking, causes a greater number of deaths than lung cancer does, then 
smoking would have fewer susceptibles to kill with heart disease than with 
cancer of the lung. In that case, a given number of deaths from heart disease, 
attributed to smoking, would reflect a greater mortal force for heart disease 
than the same number of deaths would reflect for cancer of the lung. I have 
not been able to think of a reasonable theory of the action of smoking in causing 
death, for which it would be appropriate to measure the increased death rate 
from a specified disease, attributed to smoking, as a proportion of the total 
death rate from that cause. The exclusive use of this index, in the present 
instance, is open to criticism in particular, because it tends to bolster up the 
theory being advanced—the association of smoking and lung cancer—and to 
tone down the association with other causes of death. For instance, the death 
rate for lung cancer is about a sixth that for coronary thrombosis. Therefore, 
when measured relatively, a given number of deaths from lung cancer appears 
six times as large as the same number of deaths from coronary thrombosis. 
And, of course, from a strictly practical viewpoint, it is only the total number 
of increased deaths that matters. In the considerable literature that has devel- 
oped about smoking and lung cancer, it is not the enormous import for biology 
of a discovery that smoking can cause cancer that has been emphasized— 
hardly a word about that. Instead there has been great advertisement of the 
social evil of causing so many people to die—and something should be done 
about that.' In these circumstances, it is especially to the point, that the number 
of deaths should not have been obscured, in a preoccupation with the ratio of 
deaths. 

Thus, on the basis of a comparison of the death rates of heavy smokers with 
those who are less than heavy smokers, it is clearly indicated in these data 
that smoking, or at least heavy smoking, increases the probability of death 
from every cause, so far as these causes are represented in the five categories 
listed by Doll and Hill. If the mean of the differences of death rates between 
heavy smokers and less than heavy smokers is taken as an index, the dif- 





1 What is to be done, it appears, is to be limited to hortatory efforts at dissuading people from smoking cigarets. 
No program has been initiated, for inst , to develop an immunising vaccine, a step that, since Pasteur, has 
typically followed the identification of a specific cause of a disease. 
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ference of death rate from all causes is 4.6 (Table BA). Of this difference, the 
difference of death rate for coronary thrombosis constitutes 32 per cent, while 
the difference in death rate for cancer of the lung constitutes only 26 per cent. 
But none of the other categorized causes of death fails to contribute to the 
difference of rate from all causes; other cancers than lung cancer contribute 16 
per cent, other respiratory diseases 9 per cent, and “other causes” 16 per cent. 
Doll and Hill emphasize in their discussion that, due to the attenuation of the 
data, any relation found “will be an understatement of the true relationship,” 
and this seems to be a sound inference. Accordingly, the associations described 
above between smoking and each of the categorized causes of death can be 
considered an understatement of the true association between smoking and 
the death rate from causes represented in each of these categories. 

Are these associations “statistically significant”? We are not concerned, at 
the moment, with whether there is association with some specified category of 
disease, but with the validity of the evidence that there is association with 
disease in general. Appropriate here is some form of permutation test of the 
kind being developed by Box and Andersen [2]. However it is figured, the 
probability of getting by chance, if there were no generalized association with 
all or many of the separate categories of disease, consistently higher death rates 
among the heavy smokers than among any of the three categories of less than 
heavy smokers, in each of five predesignated categories of cause of death, and 
in agreement with the independently obtained similar finding in the prospective 
study of Hammond and Horn, must be considered negligible. The appearance 
of a generalized association in the first report of Hammond and Horn, not 
clearly evident in the first report of Doll and Hill, is reasonably ascribable to 
the larger number of cases in the former study.? As already noted, with more 
death notices available in the second report of Doll and Hill, an association of 
smoking in all five categories of cause of death became discernible. When Doll 
and Hill present later reports, based on more ample data, such associations 
may be exhibited with even greater clarity. 

The existence in the data of both these prospective studies, of association 
between smoking and death rate from many categories of cause of death other 
than lung cancer, and indeed chiefly with these other categories, raises a serious 
challenge to the explanation that the statistical association observed between 
smoking and lung cancer has a direct causal basis.* Some physical explanation 
must be found for the other associations, unless statistics is to be exposed to 





2 The total number of men in the study of Hammond and Horn was about 188,000; the number in the study 
of Doll and Hill was about 35,000. In the 53 months covered by the study of Doll and Hill [5] there were 1714 deaths 
or 388 deaths per annum. In the 20 months experience of the report of Hammond and Horn [11] there were 4854 
deaths or 2912 deaths per annum, relatively over 7 times as many. Considering the much greater numerical defin- 
itiVeness of the study of the American Cancer Society, it is regrettable that it has been decided to terminate that 
study with its second report. In follow-up studies of this kind, it is frequently the late results which are the most 
revealing. 

3 In any critical analysis of statistics claiming to establish a causative association between disease and an en- 
vironmental factor, the great importance of the disease specificity has rightly been stressed by Yerushalmy and 
Hiileboe [17]. The report of the Medical Research Council of England [13a] on cancer of the lung, seems, in part, to 
recognize this, when it says, referring to the retrospective investigation of Doll and Hill [6] “Analysis of the histories 
and habits of the patients with various diseases revealed only one striking contrast—the difference in the smoking 
habits of those with and those without lung cancer (ital. inserted).” Yet the report later refers to the study of Ham- 
mond and Horn [11], in which the association with smoking, so far from being specific for lung cancer, is evident 
for each of the disease groups categorized; but the report makes no note of the fact. 
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the charge of scientific irresponsibility.‘ It would assist materially in under- 
standing the analysis of Doll and Hill, and that of Hammond and Horn, if we 
had answers, as clear and unequivocal as the complexity of the problem per- 
mits, to the following questions: 

1. Is their explanation of the association of smoking with lung cancer, at 
least for a large proportion of the estimated excess deaths among smokers from 
this cause, about as follows? Cigaret smoke contains carcinogenic substances 
that, by contact with the bronchial and pulmonary tissues over a sufficiently 
long time, induce changes resulting in cancer. 

2. Is their explanation for the association shown with cancers other than 
lung cancer, including cancer of such sites as the colon, stomach and pancreas, 
also that the relation is causal? If so, what in general terms, is the mechanism 
of this causation? 

3. Do they think that the association with the death rate from coronary 
heart disease, which has the largest number of estimated excess deaths associ- 
ated with smoking or heavy smoking of any disease category, is causal? If not, 
what is their explanation of the association? 

4. Do they have any explanation of the association of smoking with the 
death rate for the miscellaneous category of diseases, which excludes cancer, 
and cardiovascular disease and noncancerous respiratory disease? 

For myself, I find it quite incredible that smoking should cause all these 
diseases. It appears to me that some other explanation must be formulated for 
the multiple statistical associations found with so wide a variety of categories 
of disease. And if we are not crassly to violate the principle of Occam’s razor, 
we should not attribute to each separate association a radically different 
explanation. 

One explanation is that the associations have a constitutional basis. This 
hypothesis has been formulated—and rejected—by Doll and Hill [5] in the 
following terms: “...it has been suggested that constitutional and psy- 
chological factors might have such an effect ... that persons of a certain 
‘make up’ are peculiarly liable to lung cancer and to smoke. We know of no 
published evidence to this effect.” The hypothesis is stated more plausibly, it 
seems to me, to the effect that persons who are nonsmokers, or relatively light 
smokers, are of a constitutional type that is biologically disposed to self- 
protective habits, and that this is correlated generally with constitutional resist- 
ance to mortal forces from disease. If 85 to 95 per cent of a population are 
smokers,® then the small minority who are not smokers would appear, on the 
face of it, to be of some special type of constitution. It is not implausible that 
they should be on the average relatively longevous, and this implies that death 
rates generally in this segment of the population will be relatively low. After 
all, the small group of persons who successfully resist the incessantly applied 
blandishments and reflex conditioning of the cigaret advertisers are a hardy 
lot, and, if they can withstand these assaults, they should have relatively 





se 
« Cancer is a biologic, not a statistical, problem. Statistics can soundly play an ancillary role in its elucidation. 
But if biologists permit statisticians to become the arbiters of biologic questions, scientific disaster is inevitable. 
5 In the retrospective study of Doll and Hill [7], less than 5 per cent of the controls were nonsmokers. In 
the prospective study [6} only 13 per cent were nonsmokers. In the prospective study of Hammond and Horn [11], 
17 per cent were r ded as kers (lifetime history). 
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little difficulty in fending off tuberculosis or even cancer! If it seems difficult 
to visualize how such a constitutional influence can carry over to manifest 
itself as a graded increase of death rate with a graded increase of intensity of 
smoking, then we must remember that we are wandering in a wilderness of 
unknowns. I do not profess to be able to track out the implications of the con- 
stitutional theory or to defend it, but it cannot be disposed of merely by fiat 
denial.® 

Another explanation that must be considered seriously is also referred to by 
Doll and Hill [5] in their recent report, when they said: “If the association 
suggested by the upper part of the table’ were due merely to a bias in our meth- 
od of investigation, we would expect to see that bias operating to some extent 
in all, or nearly all, causes of death. It does not appear to do so.” Although the 
wording of the quoted passage makes its exact meaning somewhat obscure, 
I take it to imply that a show of association of smoking with the death rates 
from all or nearly all causes of death would, in so far, be evidence that there 
may be a bias in the method of investigation. Does the finding in the data 
of Doll and Hill of association in all of the five categories of disease of their 
Table V, and the association with the more specific causes shown in the data 
of Hammond and Horn, satisfy the requirement of “all or nearly all diseases”? 
So far as the data of Doll and Hill are concerned, the relatively small numbers 
of deaths reported render minute examination of specific causes of death not 
very profitable; however, by gathering together data from their Tables V, 
XIII and XVI, the death rates for the different classes of smokers may be 
examined in 15 categories of cause of death (Table 34). The death rate for the 
heavy smokers is higher than that for the nonsmokers in 12 of the 15 categories, 
although in several instances the number of deaths, the difference of rates, or 
both, are small. It is notable that in the category containing cancer of other 
sites (other than respiratory, gastrointestinal and prostate), there were 88 
deaths (compared with 84 deaths from cancer of the lung), and this group 
shows a graded increase of death rate with increased amount of smoking, from 
arate of 0.64 for nonsmokers, to a rate of 1.02 for heavy smokers. There were 77 
deaths by violence, almost as many as for lung cancer; while the death rates 
for violence do not increase monotonically with increasing intensity of smoking, 
the death rate is the highest for the heavy smokers (0.90), and lowest for the 
nonsmokers (0.42). 

In the much more ample data of Hammond and Horn [10], the categories 
represented in the figure [page 35], in all of which association is exhibited, are 
fairly narrow. But even more to the point, their detailed data show association in 
many individual causes of death [9]. Indeed, in those data, it is more a matter 
of singling out the causes for which no association is evident, than of pointing 
out the causes for which there is association.’ Unless a case can be made out for 





* The rejection of the hypothesis of constitution by Doll and Hill, cited above, had reference to the association 
of smoking specifically with lung cancer. Doll and Hill may be more sympathetic to it in the light of the disclosure 
of the generalized associations. 

1 Table XVII. It gives the standardized death rates for a summation of groups of causes of death that, it is 
said, have from time to time been’regarded as possibly'related'to smoking. For this group of causes of death, taken 
together, the death rate increases monotonically with increase of smoking rate. 

* Recently, a number of independent studies have appeared reporting an association of smoking with diseases 
other than cancer of the lungs, such as tuberculosis [13], cancer of the bladder [12], prematurity of births and thy- 
roidal disorders [15]. 
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TABLE 34 


DEATH RATES FOR VARIOUS SMOKING CLASSES FOR INDIVIDUALIZED 
CATEGORIES OF DISEASE: REPORT OF DOLL AND HILL{5} 
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assuming that there is bias in one rather than tie other of the two prospective 
studies, it seems reasonable to consider them both together; when this is done, 
there can hardly be any doubt that association is shown for “all or nearly all 
causes of death.” And, following Doll and Hill, this is indicative of the possible 
presence of bias. 

I do not like the term “bias,” since it may seem to imply conscious deception, 
though of course Doll and Hill certainly did not mean to imply that. However, 
when an investigation set up to test the theory, suggested by evidence pre- 
viously obtained, that smoking causes lung cancer, turns out to indicate that 
smoking causes or provokes a whole gamut of diseases, inevitably it raises the 
suspicion that something is amiss. It is as though, in investigating a drug that 
previously had been indicated to relieve the common cold, the drug was found 
not only te ameliorate coryza, but to cure pneumonia, cancer, and many other 
diseases. A scientist would say, “There must be something wrong with this 
method of investigation.”® It would be highly speculative, and invidious, to 
attempt to identify specifically what the character and mechanism of entry 
of such a bias are. However, it must be pointed out, that the observations con- 
stituting the two main variables under study, namely a history of smoking, 





* However, a modern statistician, practiced as he is in dialectics, may reply, “The fact that the drug cures 
cancer does"not disprove that it helps*colds.” 





SMOKING AND LUNG CANCER 
EXCESS DEATHS AMONG MEN WITH A HISTORY OF REGULAR CIGARET SMOKING 


Coronary Artery Disease: 1,388 YH YMMVqH[} i,_—s«52-1\% 
Lung Cancer; 300 Y,_—‘13-5% 

Other Cancer: 39 Yi, 13.5% 

Other Heart & Cire.: 5.8% 

Pulmonary (Exc. Ca.): | WA >-®% 

Cerebral Vascular: YY, 4-®% 

Gastric & Duod. Ulcers: Yn 2.8% 


Cirrhosis of Liver: 1.5% 


All Other: 0.4% 


Total 2,665 


The number of excess deaths attributed to regular cigaret smoking, with the percentage 
of the total deaths attributed to each cause listed. (Reproduced from figure of Hammond 
and Horn.) 


and the cause of death, are subject to considerable and even extreme error,’ 
and the samples are selected, possibly highly selected.“ In the presence of a 
combination of such diversionary complications, we may expect “almost any- 
thing” to happen. 

If the observed statistical association between smoking and the death rate 
from many causes including cancer of the lung is not spurious, that is, if it is 
not a vagary of the data, and the “constitution” explanation is considered un- 
convincing, another explanation that may be offered is that smoking increases 
the “rate of living” (Pearl) [14]. Actually I do not know of any independent 
evidence for such an effect of smoking. It is Pearl’s concept, and the findings in 
the studies relating smoking to death rates, that suggested it In their first 
report, Hammond and Horn [11] made the interesting observation that the 
general death rate among smokers, is similar to that among nonsmokers of 
more advanced age, and this observation is consonant with the proposed 
theory. To be sure, nothing is known about the properties of tobacco smoke 
that would produce such an effect, or what specifically are the physicochemical 
alterations in the cell that constitute such a change. But neither is anything 
known about the physicochemical processes that constitute normal aging. It 
is conceivable that smoking over a long period creates an altered internal 
environment for the body cells, which modifies the rate of those biologic proc- 
esses that affect the aging of tissues. But the most important consideration 





10 J have a study of this question under way. 

u In the investigation of Doll and Hill, about 30 per cent of the physicians to whom questi ires on ki 
were sent did not reply or sent a reply insufficiently complete to be utilized. In a case like the study of Hammond 
and Horn, the reference population is not definable. The amount of selection, therefore, cannot be estimated, and 
the problems of selection discussed by Brownlee [3] become more difficult of evaluation. 
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with respect to a theory is not whether it appears plausible, but whether it 
suggests experiments, and what experiments are suggested. This theory does 
bring to mind experiments, but not those consisting of the applicat:on of prod- 
ucts of tobacco smoke to tissues followed by observation to discern whether 
cancer develops. It suggests a program of what might be called the experi- 
mental epidemiology of the trophic (“chronic degenerative”) diseases. I should 
take as a model the famous program of experimental epidemiology of Green- 
wood, A. Bradford Hill, Topley and Wilson [8], but the interest would be, not 
in infectious diseases and their mode of spread, but in diseases of no known 
specific cause, and their occurrence rates under different environmental condi- 
tions. Obviously, this is not the place to try to present the details of such an 
experimental program. Its scope and general methodology can be gleaned by 
reference to the work just cited. 


SUMMARY AND REMARKS 


Firm opinions have been published to the effect that, on the basis of accumu- 
lated evidence, it is scientifically established beyond reasonable doubt, that 
smoking is an important cause of cancer of the lung. I am a member of a com- 
mittee that has sponsored some of the most important of the published studies 
and, owing to this circumstance, have felt the responsibility to make a fairly 
careful study of this evidence. My own conclusions are quite different. 

In the first place, virtually all the evidence is obtained from statistical 
studies, in the ordinary connotation of the term “statistical.” We are not deal- 
ing with the results of laboratory experiments, or even with placebo-controlled 
clinical trials. Nor is the conclusion based on a synthesis, by a “chain of reason- 
ing,” of relevant scientific knowledge from many different sources.“ Such 
statistical evidence, for a question like the identification of a cause of a dis- 
ease, at best, can be only presumptive. But even as statistical investigations, 
I do not find the published studies so sound or convincing as they apparently 
have widely been assumed to be. In the studies that have been called “retro- 
spective,” as well as in those called “prospective,” I find questionable and even 
paradoxic elements. 

In the present brief communication, I have referred to such a parodoxic 
element in two prominent prospective studies. These studies were projected 
as a check on the theory that smoking causes cancer of the lung, a theory de- 
rived from previous observations pointing to that specific conclusion. What the 
prospective studies actually revealed was an association of smoking, not 
specifically with lung cancer, but with a wide variety of diseases, including 
diseases that never have been conceived to have the same etiology as cancer. 
It is not logical to take such a set of results as confirming the theory that tobacco 
smoke contains carcinogenic substances which, by contact with the pulmonary 
tissues, initiate cancerous changes at the site of contact. Nor is it wise to look 
aside from positive findings that do not neatly fit the simple theory that initi- 





12 As was the case for instance with the investigation of cholera by Snow, (Frost) [7a]. In this connection it 
is pertinent to quote the opinion of Stocks and Campbell [15a]. “Certainty about the causation of lung cancer is 
unlikely to result from any single research, whether chemical, experin.vntal, clinical, or statistical; most probably 
conclusions will have to be reached on the grounds of consistency bet» sen data and by pieving together evidence 
from diverse sources. ... * 
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ated the investigation. Something more important may have been discovered 
than was first conceived. The results suggest that in data which have been 
obtained as these data have been collected there are factors which produce a 
statistical association between smoking and the death rate from disease gener- 
ally. I have suggested for consideration three explanations. 

1. The observed associations are “spurious,” that is, they have no biologic 
significance, but are the result of the interplay of various subtle and compli- 
cated “biases.” The definitive variables, namely a history of smoking and the 
cause of death, are, as observations, subject to considerable error, and the 
samples, not having been obtained (or obtainable) by scientific sampling 
methods, are “selected.” In order to determine the effect of such errors and 
selection, it will be necessary to make independent investigations, and it may 
be difficult to plan and carry out such studies. At any rate, the possible effect 
of these factors is not determined scientifically by speculative reflections on 
what seems “unreasonable,” or by declarations that this or that possible effect 
appears “farfetched.” What is required is investigation. 

2. The observed associations have a constitutional basis. Persons who are 
nonsmokers, or relatively light smokers, are the kind of people who are bio- 
logically self-protective, and biologically this is correlated with robustness in 
meeting mortal stress from disease generally. 

3. Smoking increases the “rate of living” (Pearl), and smokers at a given age 
are, biologically, at an age older than their chronologic age. As a result, smokers 
(in particular, heavy smokers) are subject to the death rates of nonsmokers or 
relatively light smokers who are chronologically older. Diseases like cancer 
and heart disease, the death rates for which have a pronounced gradient with 
age, will be considerably more prominent in heavy smokers than in nonsmokers 
or relatively light smokers of the same age. A particular significance of this 
theory is that it suggests a novel program of experiments for verification— 
what might be called the experimental epidemiology of noninfectious diseases. 
In this theory, it would not be expected that experiments with inhalation of 
tobacco smoke in animals specifically would produce lung cancer. The effect 
would be manifested in increased metabolic and vital activities, some of them 
possibly salutary, while at the same time death rates would be higher from 
diseases generally, especially the “degenerative” or trophic diseases, which 
include cancer and cardiovascular diseases. 
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AN EXPERIMENT WITH WEIGHTED INDEXES 
OF CYCLICAL DIFFUSION 


Bert G. HickMAN 
The Brookings Institution 


Diffusion indexes measure the extent of common participation of the 
various sectors of the economy in expansions and contractions of ag- 
gregate activity, and are used to study the relationships between the 
degree of participation and other characteristics of business cycles, such 
as turning points and amplitudes. The usual index takes account only 
of the direction of change of the components. An experiment was under- 
taken to determine if the behavior of a diffusion index of employment 
would be affected when it was weighted to allow for the size of the com- 
ponent industries, the amplitudes of the individual fluctuations, or the 
combined influence of size and amplitude. Size weighting made little 
difference, because the average size of industries in the expanding and 
contracting groups remained fairly constant despite substantial fluctu- 
ations over time in the number of industries contained in the two groups. 
Amplitude weighting exaggerated the month-to-month movements of 
the unweighted index, because the forces which made for an increase 
of diffusion also tended both to amplify the average increases in rising 
industries and to dampen the average declines in falling industries. The 
economic implications of the findings are discussed and several sug- 
gestions are offered about the use of diffusion indexes in the search for 
the causes of business cycles. 


HE National Bureau of Economic Research has long had an interest in the 

diversity of experience which lies beneath the comparatively smooth cycli- 
cal fluctuations revealed by aggregative measures of economic activity. Indeed, 
the interest may be said to antedate the Bureau itself. Wesley Mitchell’s 
earliest book on business cycles, published in 1913, emphasized the variety of 
processes by which expansionary or contractionary tendencies originating in 
particular sectors fan out through the economy and cumulate into generalized 
upswings and downswings, only to be reversed as more and more sectors en- 
counter opposing forces, until finally the over-all balance is tipped in the other 
direction.! The same broad conception of “ ... business cycles as a sequence 
of phases each of which is & highly complex aggregate of conditions in different 
industries—conditions which are never strictly uniform, and which are at times 
markedly divergent” reappeared in 1927 in his next major work on the sub- 
ject.2 The topic again recurred in Burns and Mitchell’s Measuring Business 
Cycles (NBER, 1946), notably in the sections of Chapter 4 dealing with 
“Diffusion of Specific Cycles” and “Dependability of the Reference Dates.” It 
was not until 1950, however, that the Bureau began to publish summary 
measures of the phenomenon in the form of the diffusion indexes which have 
become so familiar since that date. Both New Facts on Business Cycles, by 
Arthur F. Burns, and Statistical Measures of Cyclical Revivals and Recessions, 

1 Cf. Wesley C. Mitchell, Business Cycles, Berkeley, California, 1913, especially Part III. 


2 Business Cycles, The Problem and Its Setting, National Bureau of Economic Research, New York, 1927. The 
quotation is from page 456. 
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by Geoffrey H. Moore, appeared in that year, and both papers contained dif- 
fusion indexes.’ The titles are significant, for they indicate the twofold use to 
which diffusion indexes may be put—analysis of business cycle phenomena on 
the one hand and short-term business forecasting on the other. Much of the 
published work on the indexes has stressed the latter use, but it is to the former 
that the present paper is primarily directed. 

A diffusion index measures the proportion of time series in a given collection 
which are expanding at a given time. If in a group of 100 series, 80 were in- 
creasing and 20 decreasing in a particular month, the index value for that 
month would be 80 per cent. Since the only information that is needed about 
any series is its direction of change, a diffusion index may be constructed for 
a collection of series which is heterogeneous with regard to the aspect of eco- 
nomic activity measured or the unit of measurement. Several of the early 
diffusion indexes were based upon such heterogeneous collections, although 
indexes were also presented for groups which were homogeneous with respect 
to activity measured (production, orders, employment) or timing relationship 
to business cycles (leading, roughly coincident, or lagging activities).‘ 

Another characteristic of many of the original indexes was that the direction 
of change of each component series was determined by its cyclical phase rather 
than by its actual change in a given month. Thus, once the cyclical troughs 
and peaks of each individual series had been dated according to the National 
Bureau procedures,’ which are designed to discriminate between cyclical and 
other movements, the series was counted as expanding in every month between 
a trough and peak (including the peak month) and as contracting in every 
month between peak and trough (including the trough month). All intra-phase 
reversals of direction were smoothed out. 

Diffusion indexes constructed according to the plan sketched in the pre- 
ceding paragraph have certain interesting properties. They reveal cyclical 
fluctuations which lead the business cycle, in the sense that the peaks and 
troughs of the diffusion index lead the peaks and troughs of business cycles as 
dated in the chronology of the National Bureau or as traced out by such 
indicators of aggregate economic activity as industrial production or factory 
employment.* The indexes also display a marked tendency to cross the 50 per 
cent level at about the same time that the peaks and troughs of the busines 
cycle are reached, provided, of course, that the indexes are not based upon 
groupings of series which tend consistently to lead or lag the business cycle. 
Together, these properties suggest that the scope (as defined by the proportion 
of expanding activities) of a business expansion tends to diminish before the 
peak in over-all activity is reached; that general business contraction sets in 
at about the time that the number of expanding activities changes from a 
majority to a minority; that the proportion of expanding activities reaches a 
minimum and begins to rise before the trough in general business is reached; 





* The first paper is published in the 30th Annual Report of the National Bureau of Economic Research (1950) 
and is reprinted in Arthur F. Burns’ The Frontiers of Economic Knowledge, Princeton University Press, Princeton, 
1954. The second is Occasional Paper 31 (1950) of the same organisation. 

4 See the papers by Burns and Moore cited above. 

* Burns.and Mitchell, op. cit., pp. 56-66. 

* Burns, op, cit., p. 9. 
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and that a new cyclical expansion develops at about the time that the expand- 
ing activities again attain a majority. 

These timing relationships, be it noted, are largely independent of the com- 
position of the indexes, applying equally well to those constructed from homo- 
geneous components of a given aspect of economic activity or from hetero- 
geneous series, provided that appropriate modifications are made for the timing 
relationship to the business cycle of the activity upon which a homogeneous 
index is based. The leading properties of the heterogeneous indexes are not due, 
then, simply to the fact that some activities, such as the placement of new or- 
ders, would be expected to lead others, such as production. There is industrial 
as well as process diffusion—a diffusion index developed from the components 
of an index of industrial production will lead the production index. It will also 
lead the business cycle, since industrial production conforms closely to the busi- 
ness cycle as defined and measured by most observers. 

Once these properties had been noted, they suggested the exciting possibility 
that diffusion indexes might be used to forecast cyclical turns in aggregate 
activity. For this to be done, however, the directions of change of the com- 
ponents of the index must be determined by some other device than dating the 
cyclical turning points, since the most recent of the latter can usually be known 
only by hindsight. Several approaches have been tried by the National Bureau 
and by others: counting the actual direction of change of a series in a given 
month, but weighting it in proportion to the number of immediately preceding 
consecutive months of change in the same direction; smoothing the original 
data by moving averages and basing the directions of change on the smoothed 
series ; and simply counting the actual change shown each month by the original 
(seasonally adjusted) series. An example of the last type of index is graphed as 
the third series from the top in Chart 42; it is an index of diffusion of employ- 
ment of production workers in manufacturing industries. It will serve to illus- 
trate some of the problems of using “current” diffusion indexes in forecasting 
work. 

First, it will be noted that the short-term trend of the index is often obscured 
by irregular month-to-month fluctuations. Secondly, although the index did 
lead the business cycle turns of November 1948, October 1949, July 1953 and 
August 1954, it will be seen that it also “predicted” several turns which failed 
to develop. There were, indeed, minor oscillations in total employment of pro- 
duction workers in manufacturing (top line of Chart 42) corresponding to 
the “false” movements of the diffusion index, but even these did not take on a 
“cyclical” aspect. 

Our present concern is with other problems, however. If the reader will 
compare lines 2 and 3 of the chart he will note a marked similarity between the 
two curves. The series plotted on line 2 is the absolute monthly rate of change 
of total production worker employment. A similar correlation between the 
monthly rate of change of an aggregate and a diffusion index derived from the 
monthly changes of direction of the components of the aggregate has been ob- 
served in other cases.’ With regard to industrial production, for example, 





1 Geoffrey H. Moore, “Diffusion Indexes; A Comment,” The American Statistician, October 1955, p. 13. 
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Vertical lines indicaie business cycle peaks and troughs as dated by the National 
Bureau of Economic Research. 
The diffusion indexes arc based upon 21 two-digit manufacturing industries. 


Cuart 42. Number of production workers in manufacturing, total, monthly change, 
and diffusion indexes (seasonally adjusted). 


Arthur Broida established the correlation and went on to argue that “forecasts 
of the production index could be made about as readily” from the rate of change 
curve as from the diffusion index, and “not very readily from either.”* The 
force of this observation may be judged by the reader, since the same statement 
could be made as suitably about the relationship among the first three curves 
of Chart 42. 

But why should there be a correlation between the diffusion index and the 
rate of change of the aggregate? This is an intriguing question about the anat- 
omy of business cycles, and one which is quite independent of the forecasting 
issue. As Broida points out, the rate of change of an aggregate is affected not 
only by the proportion of expanding to contracting components, but also by 
the relative sizes and relative amplitudes of the components in the two classes. 





* “Diffusion Indexes,” The American Statistician, June 1955, p. 15. Broida used the percentage rate rather than 
the absolute rate of change for the comparison. 
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The fact that a positive correlation exists suggests that the last mentioned 
factors do not nullify the tendency for wider diffusion to raise the aggregate 
and narrower diffusion to lower it; but the fact that the correlation is not per- 
fect suggests that a detailed investigation of the relationships among size, 
amplitude and diffusion of the components and fluctuations in the aggregate 
may be rewarding. 

Let us put the rate of change of the aggregate aside for the moment, and 
ask how we might go about constructing a weighted diffusion index—perhaps 
it is not wise to ignore the relative size of a component or its relative amplitude 
in the construction of the indexes. With regard to amplitude, the scheme that 
comes most readily to mind is to total the amplitudes of all expanding series 
and to express the resulting figure as a percentage of the sum of the combined 
amplitudes of all expanding series and the combined amplitudes (disregarding 
sign) of all contracting series. Thus, in the example of a series expressed in 
original units like factory worker employment, if one industry increased em- 
ployment by 20 per cent and two others experienced declines respectively of 
5 and 25 per cent, the “amplitude weighted” diffusion index would be 20 
divided by 50. This value of 40 per cent compares with one of 33 per cent 
for the standard unweighted (or equally weighted) index. It will be noted that 
the two indexes would be the same only if the average amplitude of the ex- 
panding series were equal to the average amplitude of those contracting. At 
least, that would be true were it not for a technicality of the usual method of 
construction of unweighted indexes. If there should be two more industries in 
the group and if they happened to experience no change, they would be con- 
sidered to fall one-half in the expanding and one-half in the contracting groups, 
so that the unweighted index would be computed as 2 divided by 5, or 40 per 
cent. The corresponding amplitude weighted index would be unaffected by the 
presence of the fourth and fifth industries and would remain at 40 per cent, 
however, since its value depends only on those amplitudes which differ from 
zero. 

A “size weighted” diffusion index is also easily formed from the components 
of an homogeneous aggregate. Total production worker employment, for ex- 
ample, is the sum of employment in a number of industries, large and small. 
In order to make a large industry count more than a small one, it is only 
necessary to total the employees of all industries experiencing increases and to 
express that figure as a percentage of total employment in all industries. Thus 
if industry A employs 500 people in January and 600 in February, while in- 
dustry B shows corresponding totals of 400 and 300, and industry C of 100 
and 50, the size weighted index for February would be 500 divided by the sum 
of 500, 400, and 100, or 50 per cent. The unweighted index for the same situa- 
tion would again be 33 per cent, of course. 

Several points about the size weighted ‘ndex require clarification. First, in 
this particular example the measure of industry size varies from month to 
month. Hence, it is necessary to choose either the January or the February 
employment as the weight for the expansion or contraction which occurs be- 
tween January and February. The initial month has been chosen arbitrarily as 
the weight-base month, although the diffusion index itself has been located in 
the terminal month. The reason for this choice will become apparent a little 
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later. Second, if all industries in the group experience either increases or de- 
creases, the size weighted index will differ from an unweighted index whenever 
the average size of the expanding industries does not equal the average size of 
the contracting industries. Third, it is necessary, however, to take account of 
any industries for which neither increases nor decreases occur in a given month, 
since the denominator of the index includes employment in all industries in the 
sample. This may be done quite simply by assigning one-half of the employ- 
ment in the no-change industries to the expanding group, but this may cause 
the size weighted index to differ from the unweighted index even when the 
average size of expanding and contracting industries is the same. 

A weighted diffusion index which takes into account both relative size and 
relative amplitude may also be computed. To illustrate again with the employ- 
ment data, the absolute change for a given industry in a given month will 
depend upon both the size of the industry and the relative amplitude of the 
change from the preceding month. In the case of industry A cited above, for 
example, the increase in employment of 100 from January to February was the 
product of an initial employment of 500 and a proportional increase of 20 
per cent. (It is because of this relationship that the amount of employment in 
January was used in the size weighted index corresponding to the unweighted 
and amplitude weighted indexes for February.) Since the absolute change in 
each industry summarizes both the amplitude and size effects, a “full weighted”’ 
index may be calculated by forming the ratio of the sum of absolute changes 
for expanding industries to the sum of absolute changes for both expanding 
and contracting industries, with the sign of the latter disregarded. Using the 
data for industries A, B, and C, the index would be 100 divided by 250 (100 
plus 100 plus 50), or 40 per cent. The full weighted index will diverge from the 
unweighted index if the average amount of absolute change in expanding and 
contracting industries is unequal or if the unweighted index has been damped 
by one or more no-change industries. 

Two final technical points should be noted before we turn to an examination 
of the weighted diffusion indexes of employment which have been calculated 
according to the foregoing plan. First, although the initial amount of employ- 
ment when multiplied by the percentage change yields the absolute change in 
employment for any given industry, no such multiplicative relationship holds 
between the corresponding size, amplitude and full weighted diffusion indexes. 
Second, analogous indexes of diffusion can be constructed from the components 
of weighted-aggregate index numbers, with minor differences in details. Thus, 
the author constructed weighted diffusion indexes from the Federal Reserve 
Index of Industrial Production, although they are not presented here because 
they differed only slightly from the employment indexes in either time-shape 
or inter-relationships.°® 





* Unlike the employment case, the measure of size in the production index is a constant for each industry; 
namely, the weight assigned to it in the production index base period. Instead of the percentage amplitudes, the 
absolute changes in the production indexes for individual industries were used as weights iu the amplitude weighted 
diffusion index. For an individual industry, the base weight multiplied by the absolute change in its production 
index yields the change in the points contributed by the industry to the total index of industrial production. The 
changes in points contributed measure the joint effects of size and amplitude, and therefore provide the data for 
the full weighted diffusion index, which is computed as the ratio of the sum of increases in points to the sum of both 
increases and decreases in points, disregarding the sign of the latter. 
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Now for a look at the set of employment diffusion indexes. All four variants 
are graphed in Chart 42. The first thing that we notice is how alike they are. 
Whether unweighted, or adjusted for size, amplitude, or combined size-ampli- 
tude effects, the indexes are notably consistent with regard to month-to-month 
changes of direction, short-term trends of increase or decrease, and dates at 
which the 50 per cent level is broached from above or below. If there are any 
readers who have been uncomfortable about the lack of explicit weighting in 
the bulk of the published diffusion indexes, they will be reassured to know that 
it makes little difference as far as timing behavior is concerned. At least, this is 
true in the present instance, and also in the case of diffusion indexes based upon 
industrial production and computed for the same time period. 

One systematic difference does exist, however, and it appears between two 
pairs of indexes. The amplitudes of the full weighted and amplitude weighted 
indexes are larger than those of their unweighted and size weighted counter- 
parts, although the divergency is less observable at the extremes which mark 
off close approaches to 100 per cent or to zero than in the irregular fluctuations 
which fall between. To make much the same observation in another way, the 
size weighted index is virtually a twin of the unweighted one, while the full 
weighted and amplitude weighted versions also bear a strong resemblance. It 
follows that the divergencies between the unweighted and full weighted in- 
dexes are due primarily to imbalances in the average amplitudes rather than in 
the average sizes of the expanding and contracting industries. This would be 
unequivocally true were it not for the damping effect of no-change industries 
on the unweighted and size weighted indexes, but that it is valid in any event 
will be shown later. 

In the present instance, then, it may be concluded that a diffusion index 
weighted by relative size of the components is scarcely distinguishable from an 
unweighted (equally weighted) index. Can this result be generalized? It was 
true also of another homogeneous aggregate, industrial production, and there 
is reason to believe that it is true more generally still. As measured by produc- 
tion or employment, the average size of expanding manufacturing industries 
tends to be constant and equal to the average size of the contracting industries 
despite frequent shifts in number and composition of the two groups, and this 
is why the size weighted and unweighted indexes are so similar. The hetero- 
geneous diffusion indexes are assembled from economic variables which are 
related to the processes of production and distribution, and to the associated 
financial flows. It seems implausible that the “relative importance” of the eco- 
nomic processes related to production in given industries should differ signifi- 
cantly from the relative importance of those industries in the aggregate of pro- 
duction. Such a difference might exist if industry size was strongly correlated 
with the nature of the product—whether it were a producer or consumer good, 
its durability, ete—but this is not so.'° If the differenze does not exist, the 
tendency for relative industry size to be neutral should be paralleled by a 





10 If industry size were correlated with those attributes of the product which are important determinants of its 
cyclical behavior, however, it is probable that the size effect would not be neutral with regard to the behavior of the 
employment or production diffusion indexes themselves. In that case, weighting for “relative importance” would 
make a difference for both the homogeneous production and employment indexes and the related heterogeneous 
indexes. 
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tendency toward neutrality of the relative importance of the components of 
related economic variables, and heterogeneous indexes of comparable breadth 
should be little affected by the absence of size weighting. 

Amplitude weighting (or its near equivalent, full weighting) is not quite so 
easily dismissed on empirical grounds, for it does impart a new look to the 
diffusion index. It is a method of weighting, moreover, that would be practical 
with heterogeneous indexes, since percentage amplitudes of the components 
could be calculated (except for series with negative values) even when no ob- 
vious means of size weighting was at hand. Nevertheless, to judge from the 
experimental indexes, the new look arises largely from an exaggeration of the 
month-to-month movements of the unweighted index, rather than from any 
significant alteration of the short-term trends or cyclical highs and lows shown 
by the latter. The amplitude effect, in other words, is positively correlated with 
the extent of diffusion, and the correlation is close enough and linear enough 
so that given degrees of diffusion are magnified by the effect without being 
altered in rank. If any significant advantage is to result from amplitude weight- 
ing, it would probably be found in this property, and would presumably have 
to do with the bearing of diffusion on the amplitude of business cycles rather 
than the timing of upturns or downturns. 

The monthly rate of change of aggregate employment may now be compared 
with the diffusion indexes. A close correspondence will be noted between the 
rate of change and the unweighted diffusion index except at those times when 
strikes cause sharp fluctuations in the former, as in the fall of 1949 and the 
summers of 1952 and 1956. Since strikes cause substantial curtailments in a 
restricted number of affected industries but leave other industries largely un- 
touched, total employment and production may drop sharply and rebound 
strongly even though the proportion of expanding industries changes compara- 
tively little. The strikes are reflected in the amplitude and full weighted in- 
dexes, however, for these do take account of relative amplitudes. 

In fact, in so far as the arithmetic is concerned, the full weighted index is 
closely related to the absolute rate of change of the aggregate. The monthly 
increase or decrease of employment may be calculated either from total em- 
ployment in two consecutive months or as the difference between the sum of 
all individual increases and that of all decreases between the two months, and 
it is the latter information which is used in the full weighted index. It must be 
stressed, however, that the mere fact that both measures may be derived from 
the same data does not in itself make them equivalent. An increase of 100 in 
total employment would be compatible with a positive change of 100 and a 
negative change of zero, or with a positive change of 1000 and a negative of 900. 
In the first case the full weighted index would be 10€ per cent, but in the second 
it would approach 50 per cent. Similarly, there is no arithmetical reason why a 

onstant absolute rate of change over time could not be compatible with a ris- 
ing, falling, fluctuating or constant full weighted diffusion index. Just as in the 
instance of the inter-relations among the diffusion indexes themselves, the close 
correlation between the rate of change of the aggregate and the full weighted 
diffusion index is due to certain systematic tendencies in the cyclical relation- 
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ships among the industries which compose the aggregate, and at one remove, to 
the underlying causes of those tendencies. 

The tendencies may be readily demonstrated with the aid of Chart 48. This 
chart shows the positive and negative components which underlie the dif- 
fusion indexes and rate of change curve of Chart 42. It will serve to clarify the 
relationships among the indexes and to support some earlier assertions. 

The top panel of the chart records the number of expanding and contracting 
industries, with the former measured upward and the latter downward from 
the zero line. The two curves rise and fall together, as indeed they must. With 
a fixed total of industries, the number rising can increase only if there is a de- 
crease in the number of falling or no-change industries, and since the latter tend 
to be a small minority at any time, most of the offsetting changes occur in the 
expanding and contracting groups. This fact shows up graphically in the tend- 
ency of the vertical distance between the curves to remain constant. 

Is there a similar constraint on the relationship between the two curves of 
the second panel, which show the total month-to-month changes in employment 
respectively in the expanding and contracting groups? If there were a fixed 
labor force of factory workers, and if manufacturing industries were constantly 
tending to full utilization of that labor force, any increase of employment in one 
or more industries would have to be offset by an equal reduction in some other 
industry or industries. Each curve would then always be the same distance from 
the zero line as the other, but the gap between them would widen or narrow 
according to the magnitude of the shift in the industrial composition of em- 
ployment. One elementary meaning of the fact that the curves move up and 
down together, then, is that fluctuations occur in the total volume of employ- 
ment. Another, less obvious, meaning is that the a priori possibility that large 
opposing movements of the expanding and contracting groups may give rise to 
small changes in the aggregate seldom occurred in the actual event during the 
postwar period. When total employment was rising rapidly, “spot declines” 
in employment were small; when the total was falling rapidly spot increases 
were negligible; and when the total was steady, offsetting movements were 
small and stable rather than large and variable. An increase in the absolute 
rate of growth of the expanding group, moreover, was usually accompanied by 
a diminished rate of decline of the contracting group, and vice versa. The major 
exceptions were during strikes, and at periods of peak “cyclical” rates of change 
in the aggregate, when the rate of change of the group currently in the minority 
hit its floor of zero. 

The significance of these last observations merits discussion, but it will be 
helpful to glance first at the remaining panels of Chart 48. The average absolute 
increase of employment in the expanding industries is graphed in the middle 
panel together with the average absolute decrease in the contracting industries. 
The reader will observe that these curves display the same general contours 
as their counterparts in the upper panels, although the cycles are damped and 
are frequently obscured by erratic fluctuations. Here is clear evidence that the 
tendency for wider diffusion to raise the aggregate is reinforced by a tendency 
for expanding industries to expand more and contracting industries to contract 
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Number of industries expanding (upper) or contracting (lower). 



























































Vertical lines indicate business cycle peaks and troughs as dated by the National 
Bureau of Economic Research. 


Cuart 48. Characteristics of expanding and contracting industries which affect the dif- 
fusion indexes of employment of production workers in manufacturing. 


less as the extent of diffusion increases. The fact that the unweighted index has 
a smaller amplitude than the full weighted one, then, is not due simply to the 
damping effect of no-change industries. Finally, inspection of the two lower 
panels of the chart shows that the cyclical pattern of the average absolute 
changes is largely the result of systematic variations in the average amplitudes 
of the expanding and contracting groups rather than of changes in the average 
sizes of the groups. The infrequent sharp changes in average size occur almost 
entirely during extreme periods when the expanding or contracting groups 
dwindle to one or two industries which may happen to be small or large. 

It is now quite clear in a statistical sense why the rate of change of the ag- 
gregate is positively correlated with the degree of diffusion. It is because the 
causes which make for an increase of diffusion also tend both to amplify the 
average increases in rising industries and to dampen the average declines in 
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falling industries, while leaving relatively unaffected the average size of the 
industries in the two groups. At the same time, during the period under dis- 
cussion, the underlying causes circumscribed the amplitude effect sufficiently 
to preserve a fairly consistent rank relationship between degree of diffusion and 
rate of change of the aggregate, which broke down mainly during and im- 
mediately after important strikes. 

In this connection, it isimportant not to misunderstand Broida’s observation 
that “In terms of a particular aggregate and its components, a correlation of 
both the degree of cyclical diffusion and amplitudes in individual series with 
the amplitude in the total is tautological; the larger the number of series that 
are contracting, and the lower they go, the more the total declines.” He does 
not mean to imply a tautological positive correlation between degree of dif- 
fusion and average amplitude of individual series, for he is emphasizing the 
independent influence of the latter in the passage. It is certainly conceivable 
a priori that a contraction could be mild either because all activities declined, 
but mildly, or because severer declines occurred, but were not as widely dif- 
fused. The positive empirical correlation which has been discovered by the 
National Bureau between degree of diffusion and amplitude of fluctuation in 
aggregates implies at a minimum, however, that the amplitude effect does not 
systematically neutralize the diffusion effect. Indeed, the results of the present 
experiment suggest that the two effects are usually mutually reinforcing. 

It was suggested earlier that any advantage of amplitude weighted over un- 
weighted diffusion indexes would probably have to be found in a superior rela- 
tionship of the former to the amplitude of fluctuations in the aggregate. It is 
now apparent that, paradoxically enough, it may be preferable to use un- 
weighted indexes to study the correlation between cyclical diffusion and over- 
all cyclical amplitudes. Since amplitude (or full) weighted indexes have wider 
amplitudes than unweighted ones, they may easily approach their limits of 
zero or 100 per cent even during mild movements of the aggregate (see Chart 
42). This attribute may give them less ability than unweighted indexes to dis- 
criminate among amplitudes of cyclical fluctuations. In fact, once the rate of 
change of the minority group has approximated zero, as it did in the present 
example during the recessions of 1948-49 and 1953-54 and during periods of 
expansion in 1950, 1952-53 and 1955 (second panel of Chart 48), further in- 
creases in the rate of change of the majority group cannot be reflected at all in 
the value of the weighted diffusion index, which has already reached its limit. 
If the combined effects of diffusion and amplitude of individual activities are 
to be observed, it would probably be preferable to use the data in the form 
shown in Chart 48, with the expanding and contracting groups treated sepa- 
rately. This could be done even for heterogeneous collections of series, of course, 
by plotting the percentage amplitudes respectively of the expanding and con- 
tracting groups. 

I have stressed the distinction between statistical correlations and under- 
lying causes at several places in this paper. The purpose of the paper has been 
to report on an experiment which was undertaken to see what differences, if 





u Broida, loc. cit. 
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any, weighting for amplitude or size might make in the behavior of diffusion 
indexes. The implications of the findings for the larger subject of the causes 
of business cycles is another story entirely. It is not inappropriate, however, to 
close with a few suggestions about the use of weighted or unweighted diffusion 
indexes in the study of causation. 

The phenomenon of diffusion suggests, of course, that disaggregation may 
be important in the analysis of business cycles. To quote A. F. Burns, were 
. . everything rising and falling in unison, there would be little need to fuss with 
specific factors in business, and one might center attention exclusively on aggregate 
activity. But business cycles—that is, the cycles in aggregate activity encountered 
in historical experience—are of a very different character. They are marked by ex- 
pansions and contractions that are only partially diffused through the economy, and 
it is therefore of the utmost importance to obtain as clear a notion as we can of how 

the specific cycles of different activities are tied together.” 


If anything, the results of the experiment in weighting would strengthen this 
conclusion, for we have discovered that making allowance for two kinds of 
relative importance of individual activities—size and amplitude—does not 
alter the systematic relationships which have been found to hold between 
cyclical diffusion and cyclical fluctuations of aggregate activity. 

It should be emphasized, however, that these findings do not mean that all 
industries and all activities are perforce equally important in the cyclical proc- 
ess. Some disaggregation is necessary and desirable, but how much and of what 
kind cannot be settled by summary measures of diffusion. Particularly in con- 
nection with categories of final demand, it may be both useful from the stand- 
point of manageable theories and relevant from the viewpoint of actual ex- 
perience to deal with sizeable aggregates. An increase or decrease in residential 
construction, for example, may cut across many localities and affect activity 
in many ancillary industries, and because of lags the process may be diffused 
in time as well, but it may still be worthwhile to define and study the aggregate 
as an aggregate. Given the geographic dispersion of economic activity and the 
absence of complete vertical integration in many lines of production and dis- 
tribution, fluctuations in a comparatively few “important” categories of final 
demand may be quite consistent with widespread diffusion among numerous 
industries and the manifold contractural and financial flows which bind them 
together; and yet it may be possible to explain the fluctuations in the aggregates 
without reference to their own internal composition. 

It is also quite possible, however, that disaggregation at a fairly detailed 
level may be necessary for an adequate understanding of business cycles. The 
correlation between the diffusion and amplitude effects may be the result, for 
example, of the interaction of secular and cyclical forces as they impinge on the 
various industries. Whatever the secular trend of output in an individual in- 
dustry, its rate of advance should tend to quicken (or its rate of decline to 
diminish) during general cyclical expansions and to diminish (or accelerate) 
during contractions, so long as it is subject to a positive income elasticity of 
demand which is not outweighed by possible adverse effects of cyclical changes 
in relative prices. If we ignore irregular fluctuations and think of industries as 





2 Burns, op. cit., pp. 6-7. 
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subject only to growth trends as modified by cyclical factors, it makes economic 
sense that the positive correlation between degree of diffusion and rate of 
change of the aggregate is paralleled by a positive correlation between the 
latter and the rates of change of the individual industries. It makes sense if for 
no other reason than that an increase of aggregate demand will tend both to 
stimulate more industries and to stimulate each industry more, and the larger 
the increase, the larger the stimulus. If now, in addition, the rate of change of 
the aggregate could be shown to depend upon diffusion in the sense that eco- 
nomic activity tends to be buoyant and resistant to contractions when a sizable 
proportion of individual activities are in states of vigorous secular expansion, 
and vice versa, then the extent of diffusion would become in some degree an 
autonomous variable in the cyclical process. Complete understanding of busi- 
ness cycles would require not only that diffusion be regarded as a symptom of 
the transmission of cyclical impulses from a few major categories of final de- 
mand to numerous prior stages of production, but also that individual indus- 
tries be studied in order to isolate the influence of such growth factors as tech- 
nology, population increase, and market saturation on their individual fortunes 
and hence upon the fortunes of the entire economy. 

Detailed industry studies would presumably be advisable in the foregoing 
connection, but diffusion indexes would also be helpful. As an illustration, 
separate indexes might be constructed for groups of industries or products ex- 
periencing respectively large, average, small, and negative rates of expansion 
over periods of time bridging several cycles and thought to be homogeneous 
with regard to general structural or secular characteristics. It seems likely that 
the first group would display a high average level of diffusion even during 
general business contractions, unless the latter were unusually severe. An 
opposite tendency toward a low level even during expansions should be found 
for the group of declining industries, while the intermediate grcups would 
probably show an average level of diffusion over the cycle of about 50 per 
cent.!? Demonstration of such relationships would support the hypothesis that 
mild cyclical movements are mild because they are not widely diffused, rather 
than its converse that they are not widely diffused because they are mild. 
This is because it would show that forces partly independent of the current 





% Correlated characteristics would probably be found to hold with regard to amplitudes. The average monthly 
rate of change should rise and fall with the business cycle, but for the group of rapidly growing industries it should 
tend to be positive even during contractions (or if negative, to be small absolutely and relative to the rate of increase 
achieved during expansions). Conversely, the secularly declining industries should tend toward reduced rates of 
decline or perhaps even mild increases during cyclical expansions. It is important to notice that neither these am- 
plitude characteristics, nor those about diffusion postulated in the text, are tautological results of the system of 
classification used to group the industries. A rapidly growing industry must have a high rate of growth on the 
average over the entire period, but this could be consistent with several cyclical patterns: (a) large and persistent 
negative changes during contractions, which were more than offset by larger positive changes during expansions, 
(b) negative changes during contractions which did not persist throughout the phase and which were smal! absolutely 
and relative to rates of increase during expansions, (c) persistent advances even during contractions, but at slower 
rates than in expansions, (d) persistent advances at approximately equal rates during expansions and contractions 
alike, (e) faster growth during contractions than in expansions. These various patterns could result from different 
combinations of growth curves and cyclical factors, and all would be consistent with rapid secular growth of the 
industry. Pattern (a), however, would not serve to moderate general business contractions. Cases (b) and (c) are 
considered to be typical of the rapidly growing industries, and underlie the postulated group characteristics. Pattern 
(d) is a limiting case of (c) which would yield the same result as the latter as far as diffusion is concerned but would 
not introduce any cyclical variation into the average rate of change for the group. The last pattern is not likely to 
be found often in the real world. 
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level or rate of change of aggregate activity may foster offsetting movements 
in particular sectors of the economy which persist systematically over several 
cycles and are therefore not to be explained by purely cyclical factors. The 
explanation of differences in the severity of business cycles would then have to 
be sought in those cyclically autonomous forces or structural differences which 
fo tered such independent movements during various historical periods. 

With regard to the correlation between rate of change of the aggregate and 
the degree of diffusion, Moore has pointed out that the latter yields information 
which is relevant to the application and testing of the acceleration principle, 
which 

. .. has ordinarily been formulated and tested in terms of the rate of change in ag- 
gregate output. For it seems likely that, in so far as business firms vary their invest- 
ment with the rate of change in output, it is the rate of change in their own output, 
not that in aggregate output, to which they react. And since the reaction to a decrease 
in output cannot, in general, be equal and opposite to the reaction to an increase in 
output, it is important to know something about the distribution of increases and 
decreases among firms. A diffusion index may not provide all the required information, 
but it does supplement significantly the information provided by the rate cf change 
in aggregate output." 


In much the same way, the data underlying weighted diffusion indexes of pro- 
duction can be used to supplement the unweighted indexes, by providing in- 
formation about the average rates of change of the industries currently in 
states of expansion or contraction. 

It would be best to study the expanding and contracting groups separately. 


The average rate of decrease of those industries experiencing declines may not 
be especially relevant in so far as the behavior of fixed investment is concerned, 
since downward adjustments of capacity are difficult to achieve in the short- 
run and excess capacity may readily develop as output falls. The rate of de- 
crease may be important in connection with inventory investment, however, 
for such investment may easily become negative and may therefore parallel 
even short term changes in the rate of decline of output. Incidentally, the third 
panel of Chart 48 suggests that the average rate of increase of employment per 
expanding industry tends to diminish when the number of expanding industries 
diminishes. An analogous relationship appears between the number of declining 
industries and the average rate of decline per industry. In view of the strong 
correlation between short-run changes in production and employment, it might 
be supposed that these same tendencies would be found in the data on indus- 
trial production, and this is indeed the case. The important implication as re- 
gards the acceleration principle, of course, is that a decline in the rate of growth 
of the aggregate is usually a symptom not alone of the fact that the proportion 
of declining industries has increased, but also that the still expanding industries 
are expanding more slowly on the average. 

One final point. Moore comments that “. . . our studies of diffusion indexes 
support the broad statement that the scope of a business cycle expansion or 
contraction invariably diminishes toward the end of the expansion or contrac- 





™ Moore, op. cit., p. 14. 
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tion. In our experience the factual evidence would not warrant a similar state- 
ment regarding rates of change in most economic aggregates; the evidence, 
particularly for expansions, is far more equivocal and conflicting.” This lack 
of agreement is rather surprising, given the close correlation between diffusion 
and rate of change of factory employment and industrial production during 
1947-56. It would be interesting to subject diffusion indexes based upon month- 
to-month changes to the standard NBER cycle anaiysis in order to establish 
whether the jack of parallelism may be partly due to the fact that intraphase 
irregularities are smoothed out in many of the historical diffusion indexes, but 
may be reflected in the measures of rates of change of aggregates during cyclical 
phases. 





% Moore, loc. cit. 








A STATISTICAL ANALYSIS OF PROVISIONAL ESTIMATES OF 
GROSS NATIONAL PRODUCT AND ITS COMPONENTS, OF 
SELECTED NATIONAL INCOME COMPONENTS, AND 
OF PERSONAL SAVING* 


ARNOLD ZELLNER 
University of Washington 


The earliest preliminary or provisional estimates of sixteen items in 
the national accounts of the United States have been compared with 
the most thoroughly revised estimates currently available for the period, 
1947 II-1955 IV. For corresponding revised and provisional estimates, 
average absolute differences in level and in amount of quarterly change 
have been calculated. Also the frequency of disagreement on direction 
of quarterly change has been determined. The results of these calcula- 
tions have been reviewed in the contexts of two postwar business cycles 
and with reference to the methods of estimation employed by the 
Department of Commerce. 


I. INTRODUCTION 


ANY private and public economic policy decisions and forecasts are 

formulated or constructed on the basis of preliminary or provisional esti- 
mates of the magnitudes and/or amounts and directions of change of economic 
variables in our national accounts. That preliminary or provisional information 
is the only kind available upon which to base many decisions and forecasts is 
the result of what are inevitable delays in obtaining and processing the data 
needed for making final or completely revised estimates.! At the present time, 
the most current estimates of quarterly GNP and its components, as well as 
estimates of most quarterly components of national income and of personal 
saving, are in circulation in unrevised form until revisions appear in the July 
issues of the Survey of Current Business.* Thus, provisional estimates of these 
important economic variables constitute the only information available under 
current procedures for as long as a maximum of eleven months and a minimum 
of two months.’ Further, it has been the case in the past that the revised esti- 
mates appearing in the July issues of the SCB have been subject to further 
revision. 





* The author wishes to thank the referees for helpful suggestions and criticisms. 

1 For example, the July 1956 Survey of Current Business, p. 6, explains that the revisions reported therein were 
“... occasioned by incorporation of Internal Revenue Service tabulations of corporate and individual income tax 
returns for 1953.” Also, “ ... revisions in the 1955 and 1956 figures reflect incorporation of data for 1955 which 
become available only on an annual basis and hence could not be utilized in preparing the initial estimates for that 


period.” 
? Occasionally revisions of certain estimates appear before the revisions contained in the July issues of the 
SCB. In a personal c ication (September 24, 1957) the author was informed by the Office of Business Eco- 





nomics that, “Other thaa the published quarterly estimates themselves we have no materials relevant to the nature 
and size of the revisions.” 

* Second quarter estimates which appear for the first time in the August issues of the SCB are revised in July 
issues of the following year, while first quarter estimates, first appearing in the May issues, are revised in July issues 
of the same year. The July issues generally contain revisions of the data for the first quarter of the year of the July 
issue and for the last three quarters of the previous year. 

* See National Income, A Supplement to the Survey of Current Business (1951 and 1954 editions). 
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Since provisional estimates constitute one of the bases upon which predic- 
tions are constructed and policies formulated, it is deemed important to study 
the statistical properties of past provisional estimates. The results of such study 
should be of interest not only to those in goverment who are charged with the 
responsibility of constructing reliable estimates, but also to policy-makers, to 
forecasters, and to those concerned with the origins and analysis of past de- 
cisions and of errors in past forecasts.5 

Provisional quarterly series, seasonally adjusted® at annual rates, for gross 
national product and its components, for selected components of national in- 
come, and for personal saving have been assembled from the earliest estimates 
published in the SCB for the period 1947 II-1955 IV. These series are com- 
pared below with the most completely revised corresponding estimates now 
available.’ 

The analysis is concerned with three kinds of error which may exist in the 
provisional estimates: (1) error in estimating the quarterly values of selected 
items in the national accounts, (2) error in estimating the amounts of change 
in these items from one quarter to the next, and (3) error in estimating direction 
of quarterly change.* For ecnvenience these errors will be referred to as (a) 
“error in the level,” (b) “error in the amount of change,” and (c) “error in 
direction of change.” Error in the level of a provisional estimate of a particular 
item for a particular quarter, say the ith, is assumed given by the absolute 
difference between the provisional estimate (P;) and the revised estimate (R,) 
of that item for the ith quarter, that is, e;=|P;—R,|. Error in the amount of 
change shown by a provisional estimate for the ith quarter is taken to be 
| P;-—Pin)- (Ri—R,)| . Error in showing direction of change has been defined 
to include the following cases: (a) P;—P;. and R;—R; of opposite sign, (b) 
P;—P;1=0 and R;—R;1+0, and (c) P;—P;1+0 and R;—R;1=0. 

Not only the number of times that each provisional series errs in showing 
direction of change has been recorded, but also the particular quarters in which 
errors occur. In what follows this information, as well as that on errors in level 
and amount of change, is reviewed in the contexts of the 1948-1949 and 
1953-1954 business cycles. A final section contains the conclusions of this 
study. 

II. ERRORS IN THE LEVEL OF PROVISIONAL ESTIMATES 


In Figure 56 deviations of provisional estimates from revised estimates for 16 
items appearing in the national accounts have been plotted for the period 1947 
II—1955 IV. Several of the most striking features brought out by Figure 56 





5 In study of the origins of past policy decisions and of past forecasting errors, it is obviously necessary to 
consider the information at the disposal of policy makers and forecasters at the particular times when these in- 
dividuals performed their functions. 

* The Department of Commerce's seasonally unadjusted quarterly data “ ... are not intended for analytical 
purposes; they are neither completely unadjusted nor completely adjusted for seasonal factors but rather a hybrid.” 
I. Friend with V. Natrella, Individuals’ Saving (New York: John Wiley & Sons, Inc., 1954), p. 45. See also, A. J. 
Gartaganis and A. 8. Goldberger, “A Note on the Statistical Discrepancy in the National Aceounts,” Econometrica, 
23 (April, 1955), 166-73, p. 171. 

? These were obtained from National Income, A Supplement to the Survey of Current Busi (1954 edition) 
for the period 1947 II-1951 IV and from the Survey of Current Business (July, 1956) for the period 1952 I-1955 IV. 

* Errors arising when provisional and revised estimates are linked, as is sometimes done without adequate 
footnoting in Economic Indicators (prepared for the Joint E ic C ittee by the Council of Economie Ad- 
visers) are not considered. 
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Provisional Estimates Minus Revised Estimates by Quarters, 1947 Il - 1955 1V 
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are: (a) provisional estimates of personal consumption expenditures have been 
lower than the revised estimates for all quarters except two, (b) provisional 
estimates of personal consumption expenditures on services have been con- 
sistently low, (c) provisional estimates of personal consumption expenditures 
on nondurables have been high for most of the period under consideration, (d) 
provisional estimates of producers’ durable equipment expenditures have been 
high except for five quarters, (e) provisional estimates of new construction have 
generally tended to be low, and (f) provisional estimates of personal disposable 
income and net interest income have tended to be low since about the middle 
of 1950. Also, it will be noted that for several items (consumer nondurables, 
personal saving, personal disposable income, and proprietors’ and rental in- 
come) the provisional estimates exceeded the revised estimates most markedly 
‘in the quarters directly following 1947 IT.° 

The mean and standard deviation of the absolute errors associated with each 
of the items shown in Figure 56 are presented in Table 58. The mean absolute 
errors range in size from $4.2 billion (all figures are seasonally adjusted at annual 
rates) for personal consumption expenditures on services to $0.6 billion for 
personal consumption expenditures on durables. Percentagewise, the largest 
errors are encountered for change in business inventories (71.9 per cent), net 
foreign investment (159.9 per cent), personal saving (22.2 per cent), and net 
interest income (10.9 per cent), all items estimated by the residual method, 
that is as the difference of two generally much larger totals. Change in business 
inventories is obtained as the difference between estimates of total inventories 
at two points in time, net foreign investment as the difference between gross 
outflows and gross inflows, personal saving as the difference between personal 
disposable income and personal consumption expenditures, and net interest in- 
come as the difference between payments and receipts of relevant payer 
groups.?° It is well known that the Department of Commerce's residual type 
estimates are subject to large percentage errors.'! In the present context, even 
small percentage revisions of the totals underlying the residual estimates might 
result in large relative revisions in the residual estimates and in large percentage 
errors (as defined herein) in these items. 

Four other items showing sizable percentage errors are producers’ durable 
equipment expenditures (10.6 per cent), new construction expenditures (6.0 per 
cent), expenditures on consumer services (5.8 per cent), and proprietors’ and 
rental income (5.5 per cent). Perhaps the magnitudes of these errors are ex- 
plained by the fact that heavy reliance is placed on periodic and sometimes 
rather infrequent source material in estimation of these items. For example, 
the estimates of consumer expenditures on services rest heavily on benchmarks 
derived from the Censuses of Population and Business. In estimation of expen- 
ditures on producers’ durable equipment extrapolations are made pending the 
availability of the Census Bureau’s Annual Survey of Manufactures. Further, 





* This may reflect the fact that the earlier estimates are more thoroughly revised than are the later estimates, 

10 For a more detailed description of methods of estimation see National Income (1954 edition), 135-38, 139-43, 
159, and 97-103, and Business Statistics, A Supplement to the Survey of Current Business (1955 edition), 193-97. 

In National Income (1954 edition), the following statement, referring to personal saving estimates, appears: 
“As the difference of these much larger totals, it is subject to large percentage errors in level as well as in move- 
ment... ” (p. 150). Similar statements appear in sections describing other items estimated by the residual method. 
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TABLE 58 


ERRORS IN THE LEVEL OF PROVISIONAL ESTIMATES, 
1947 II-1955 IV 




















Mean Absolute Standard Mean Absolute 
Item Deviation* Deviation Percentage 
é Ce Deviation” 
(billions of dollars, seas. adj. 
(per cent) 
annual rates) 

Gross national product 3.3 2.4 1.1 
Personal consumption exp. 2.6 1.4 RK 
Durable goods 0.6 0.4 2.2 
Nondurable goods 2.1 2.1 1.9 
Services 4.2 1.6 5.8 
Gross priv. dom. invest. 2.7 1.6 6.7 
New construction 1.4 1.4 6.0 
Producers’ durable equip. 2.3 1.6 10.6 
Change in bus. inventories 1.9 1.7 71.9 
Net foreign investment 0.8 0.6 159.9 
Government purchases a8 0.9 1.8 
Compensation of employees 2.1 1.6 1.2 
Net interest income 0.8 0.4 10.9 
Proprietors’ and rental income 2.6 2.7 5.5 
Personal disposable income ae 2.3 1.6 
Personal saving 2.5 3.7 22.2 





Sources of data: See text. 
* Average absolute deviation of provisional from revised estimates for each item. 
> Average of absolute percentage deviations defined as follows: Pj = 1002 R;(e/R;)/ DR. 


the estimate of expenditures on business passenger cars, comprising roughly 
ten per cent of producers’ durable equipment expenditures, depends on an 
allocation of expenditures on passenger cars between consumers and business 
“ _.. which is based on limited information.”** Revised estimates of proprie- 
tors’ and rental income incorporate information from income tax returns which 
becomes available only on an annual basis. Finally, the following quotation 
illuminates some of the difficulties involved in estimating expenditures on new 
construction: 
Modern techniques of systematic reporting and sampling are difficult to apply to the 
direct statistical measurement of construction activity. Many of the producers that 
should be covered are hard to identify. Much construction work is done by firms only 
intermittently attached to the industry; many construction firms have no fixed and 
readily recognizable place of business; and any firm in any industry may undertake 
force accounts projects. . . . Under these circumstances, estimation relies heavily on 
indirect evidence of construction activity ...and is subject to considerable uncer- 
tainty as to coverage, valuation and timing." 


The errors in the level of provisional estimates of individual items can hardly 
be considered temporally independent since the provisional estimates are pre- 
pared in essentially the same manner from one quarter to the next and can be 





12 National Income (1954 edition), p. 127. 
4 Ibid., p. 123. 
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Provisional Estimates minus Estimates by Quorters, 1947 Il - 1955 IV 
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Fra. 59. Components of personal consumption expenditures and of gross investment. 


corrected for bias only after bias has arisen.‘ Therefore some of the following 
test results must be interpreted with more than a grain of salt. The Mann- 
Kendall test for trend in the amplitude of the errors showed a highly significant 





4 Application of the von Neumann test reveals highly significant positive autocorrelation of the errors for 14 
of the 16 items shown in Table 58. 








60 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1958 


(99 per cent level) negative trend for six items: consumer expenditures on non- 
durables, consumer expenditures on services, new construction expenditures, 
proprietors’ and rental income, personal disposable income and personal saving. 
These findings must be tempered by the fact that data for the latter part of the 
sample period, 1952-1955, may not be as completely revised as are data for 
the period 1947-1951. Application of the Friedman rank test to determine 
whether the size of the errors is random with respect to quarters of the year 
resulted in a significant result (95 per cent level) for errors in consumer expendi- 
tures on services. For this item, the first quarter values of e tend to be large 
while the second quarter values tend to be small (see Figure 59). 


Ill. ERRORS IN PROVISIONAL ESTIMATES OF AMOUNT AND 
DIRECTION OF QUARTERLY CHANGE 


Quarterly changes in the sixteen items considered above have been calculated 
from provisional estimates and compared with those derived from revised esti- 
mates of corresponding items with the results shown in Table 62. It is seen 
that the average absolute errors in estimates of amount of change range from 
$3.0 billion for gross national product to $0.3 billion for net interest income. 
Percentage comparisons have been made on the following basis: the average 
absolute error in the amount of change has been compared with the average 
absolute quarterly change, the latter derived from the revised estimates, for 
each item (see Table 62, column 3). The largest percentage errors in the absolute 
amount of change are recorded for net interest income (120.1 per cent), proprie- 
tors’ and tental income (115.4 per cent), net foreign investment (101.9 per cent), 
and the producers’ durable equipment expenditures (96.0 per cent). The lowest 
percentage is that for compensation of employees (29.9 per cent). 

The number of quarters in which the direction of change shown by the pro- 
visional estimates differed from that shown by the revised estimates exceeded 
eight for the following items: personal saving (14), net foreign investment (13), 
proprietors’ and rental income (10), and net interest income (9). 

The items with the largest percentage errors in amount of change and the 
largest number of differences in direction of change are all items which were 
noted above as showing the largest percentage errors in level. Again, the poor 
showing exhibited by provisional estimates of these items, as compared with 
that of the other items studied, may be due to the fact that they are estimated 
by the residual method or are items whose estimation depends heavily on 
periodically available benchmark data. 


IV. PROVISIONAL ESTIMATES AND THE BUSINESS CYCLES OF 
1948-1949 AND 1953-1954 


In the 1948-1949 cycle both the revised and provisional estimates of GNP 
assume a peak value in 1948 IV and a trough value in 1949 IV." With regard 
to the level of GNP, it is the case that, except for the peak quarter, the pro- 
visional estimates are consistently lower than the revised estimates (see Figure 





18 Periods of contraction and expansion in the revised GNP series are indicated by shaded and haded areas, 
respectively, in Figure 56. 
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56).!* Differences occurring directly before and after the peak quarter are: $7.3 
billion for 1948 IT, $6.0 billion for 1948 III, and $4.0 billion for 1949 I. As for 
errors in direction of change, it is seen from Table 62 that in 1949 II and in 
1949 III, just before the trough quarter, the provisional estimates show positive 
‘or upward changes in GNP while the revised estimates show negative or down- 
ward changes. That is, the provisional GNP series shows $0.2 billion increases 
in the intervals 1949 I-1949 II and 1949 II—1949 III while the revised GNP 
series shows decreases of $2.7 and $0.7 billion for these two intervals, respec- 
tively. Differences in the amount of change shown by the two series in the 
neighborhood of the upper turning point are $6.9 billion in 1948 IV and $4.9 
billion in 1949 I with the provisional change being larger than the revised in 
the first instance and smaller in the second. Thus, consideration of the be- 
havior of the provisional GNP estimates in the period 1947 II—-1950 II shows 
that (a) they tend to be low over the entire period except in 1948 IV, (b) they 
give a slight incorrect indication of an upturn in 1949 II and a continuation of 
that upturn in 1949 ITI, and (c) they show a spuriously large decrease between 
1948 IV and 1949 I. 

The spurious upturn of the provisional GNP series in 1949 II can be traced 
to errors in provisional estimates of the amounts of change in gross private 
domestic investment, in particular new construction, and in government 
purchases. The error in showing amount of change in GNP just before the 
upper turning point arises from overstatement by the provisional estimates of 
the amounts of change in all major components with the exception of net 
foreign investment. All provisional estimates of minor components, with the 
exception of that of consumer expenditures on durables, show a similar error. 
The overstatement of the decrease in GNP between 1948 IV and 1949 I by the 
provisional estimates reflects mainly overstatement of the decreases in personal 
consumption expenditures and in government purchases. 

With regard to components of national income and personal saving, it is 
found that provisional estimates of personal disposable income, proprietors’ 
and rental income, and personal saving are consistently high from 1947 II 
through 1950 I (see Figure 63). Errors in the provisional estimates in showing 
direction of change are frequent (see Table 62). The provisional estimates of 
compensation of employees, however, are the only ones which show an incorrect 
early upturn. 

In the 1953-1954 business cycle, the provisional and revised GNP series both 
reach a peak in 1953 II; however, the provisional series assumes its trough 
value in 1954 III whereas the revised series shows a trough in 1953 IV.!7 Thus, 
in contrast to the slight incorrect early upturn shown by the provisional GNP 
estimates in the 1948-1949 cycle, the provisional GNP estimates err in the 
1953-1954 cycle by showing a continued downward movement when in fact, 
according to the revised estimates, a slight upward movement took place. Be- 





% This is mainly accounted for by low provisional estimates of personal consumption expenditures on services 
and of new construction expenditures. 

1” The 1953 IV revised GNP estimate, $357.5 billion, is only $0.1 billion less than the 1954 I revised estimate. 
This small difference gives rise to some uncertainty concerning the dating of the trough when possible errors in the 
revised estimates are considered. The 1954 II revised estimate is $358.5 billion. 
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TABLE 62 


ERRORS IN THE AMOUNT AND DIRECTION OF QUARTERLY CHANGE 
IN PROVISIONAL ESTIMATES OF SELECTED ITEMS IN THE 
NATIONAL ACCOUNTS, 1947 II-1955 IV 














Amount of Change Direction of Change 
Mean a 
s . or. ‘ 
arene yraee: tand-| Per No. 
lute ard cent- of Quarters in which 
, Devia-| age |Differ-| Differences Occur* 
Devia- 





oats tion Error? | ences | 
ion | 


(billions of dol- 
lars, seas. adj. 
ann. rates) 
Gross national product 3.0 | 1.8 48.7 5 49II*, 49I1II*, 541, 

5411, 54111 
| 1.0 39.0 2 | 491II*, 541 
| 0.5 10.1 6 | 47111, 49IV, 50lI, 
| SILL, 5214, 541 

0 | 0.9 | 65.1 3 | 48II, 54I*, 5411 
| 





(per 
cent) 


Personal Consumption exp. 1.4 
Durable goods 0.6 


Nondurable goods 1 
Services 0.6 
2 


0.6 45.9 0 
Gross priv. dom. invest. 7 


8 58.2 47III*, 471V*, 481V*, 
49III*, 511*, 541, 
5411 

New construction 0.5 0.6 61.3 5 491I*, S50IV,! 51I*, 
52111,* 541° 

481e 481V*, 49II, 
501*, 511,' 51II1*, 


— 


Producers’ durable equip. | 0.9 1.0 96.0 


5211 

Change in bus. invent. 2.2 fF 63.6 4 4711I*, 471V, 48iIl, 
541 

Net foreign investment 1.1 0.8 101.9 13 471III, IV*, 48IV, 


49III, 501I*, IV, 
S1I*, 521, S3siv*, 
541° ILe1Il*, 551V* 
59.8 8 | 491, III*, IV*, 50111, 
52111, 53111*, 54111, 
§5I1* 
Compensation of employees , { ff 48II, 49III*, 491V 
Net interest income 0.3 0.3 120.1; 9 481° 49II, III,* IV,° 
501° III,e 51IV,° 
541° 1Ve 
Proprietors’ and rental income 1.3 14 115.4 10 | 48II, IV*, 501I, 52IV*, 
531,4 541, III, 551,! 
II, I1I* 

Personal disposable income ' 50.1 1 5011 

Personal saving 2.0 1.6 66.2 14 471V, 48I*, II, IV*, 
49I*, 511V, 52II, 
IV*, 531, II, 541°, 
IV, 551*, II 
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Sources: See text 














® The absolute: deviation in the amount of change is given by: |(P —P_.) —(R—R_,)|. 
> Average absolute deviation in the amount of change as a percentage of the average absolute change shown by 
the revised series, 
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Provisional Estimates Minus Revised Estimates by Quarters, 1947 Il - 1955 IV 
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© Quarters with an asterisk are quarters in which the provisional estimates indicated a positive change while 
the revised estimates show a negative change. Unmarked quarters are those in which the provisional estimates show 
a negative change while the revised show a positive change. 

4 No change in provisional estimates and negative change in revised estimates. 

© No change in provisional estimates and positive change in revised estimates. 

f No change in revised estimates and positive change in provisional estimates. 

® No change in revised estimates and negative change in provisional estimates. 
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tween 1953 IV and 1954 III the provisional estimates decline $8.0 million while 
the revised estimates rise $1.9 billion. 

At the peak quarter, 1953 II, and for two quarters thereafter the provisional 
estimates are high while during the period in which GNP moved sideways and 
slightly upward, 1953 IV—1954 III,'* they are consistently low. They con- 
tinued to be low in the period in which GNP shows a sharp recovery, 1954 
IV-1955 IV. 

The upward bias in the provisional GNP estimates at the peak and in the 
following two quarters of the contraction phase is in the main attributable to 
high provisional estimates of gross private domestic investment. Of particular 
importance are the errors in provisional estimates of total inventory invest- 
ment and of expenditures on producers’ durable equipment. For example, the 
provisional estimate of inventory investment for 1953 II, $8.8 billion, far ex- 
ceeds the revised estimate, $3.1 billion. For 1953 III and 1953 IV, errors in the 
same direction of over $2.0 billion occur for this item. The provisional estimates 
of expenditures on producers’ durable equipment exceed the revised estimates 
by between $2.0 and $3.0 billion in these quarters. While provisional estimates 
of total personal consumption expenditures are low over this part of the cycle, 
it should be noted that the components of this total behaved differently, non- 
durables being high by about $2.0 billion and services low by about $3 to $4 
billion. 

The lowness of the provisional GNP estimates during 1954 when the revised 
estimates were moving sideways and slightly upward is a reflection of low pro- 
visional estimates of personal consumption expenditures, mainly due to low 
estimates of the service component, and of gross private domestic investment, 
particularly change in business inventories and new construction. Low pro- 
visional estimates of these items and also of government purchases account for 
the lowness of the provisional GNP estimates after 1954 III. 

The error in showing GNP’s direction of change in the neighborhood of the 
cyclical trough resulted from a compounding of similar errors in most major 
and minor components (see Table 62). Most important in accounting for this 
error in 1954 I is the fact that the provisional estimates show decreases in per- 
sonal consumption expenditures and in gross private domestic investment be- 
tween 1953 IV and 1954 I whereas the revised estimates show increases in 
both.'® 

Errors in the provisional estimates of national income components show the 
following behavior over the 1953-1954 cycle. Provisional estimates of personal 
disposable income and of net interest income are low over the entire cycle (with 
the exception of two quarters in the case of personal disposable income). Pro- 
visional estimates of employees’ compensation are lower than the revised 
estimates by between $4 and $5 billion prior to the peak quarter. The provi- 
sional estimates of proprietors’ and rental income and of net interest income 
show errors in direction of change in the vicinity of the cyclical trough. For 





8 The revised GNP estimate for 1954 III is $359.4 billion, only $1.9 billion larger than the 1953 IV trough 
value which is the reason for the use of the phrase “moved sideways.” 

18 Between 1953 IV and 1954 I, the provisional estimates of personal consumption expenditures decrease $0.2 
billion while the revised estimates increase $2.0 billion. For this interval, the provisional estimates of gross invest- 
ment decrease $2.0 billiou while the revised estimates increase $1.3 billion. 
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the first of these two items, the provisional estimates exhibit a downward 
movement in 1954 I and 1954 III whereas the revised estimates show an up- 
ward movement. For the second item, the provisional estimates show no change 
in these two quarters whereas the revised estimates show an upward move- 
ment. The error in direction of change for these two items is seen to be similar 
to that encountered for GNP in the neighborhood of the cyclical trough. 


V. SUMMARY AND CONCLUSIONS 


Provisional quarterly estimates of 16 items in the national accounts have 
been compared with the most thoroughly revised estimates currently available. 
The comparisons made in this study reveal that provisional estimates of items 
estimated by the residual method and of items whose estimation rests heavily 
on infrequently available benchmark data tend to be characterized by larger 
percentage errors in the level and amount of change than are the provisional 
estimates of other items considered. Perhaps in the future the situation with 
respect to these items will be improved since G. Jaszi has written: 

Impressive progress has been made with the sampling surveys which are used in the 
annual, quarterly, and monthly extrapolation of the benchmark figures. Larger areas 


of the economy are covered by such surveys than ever before and they make it 
possible for us to prepare estimates on an up-to-date basis.?° 


Also, it is possible that the improved methods to which Jaszi refers may correct 
the consistent biases which were discovered in the provisional estimates of 
personal consumption expenditures, personal consumption expenditures on 
services and nondurables, and new construction expenditures for the period 
1947 II-1955 IV. 

While errors in showing direction of quarterly change occurred frequently 
for some items (see Table 62), 8 of the 16 items considered showed 5 or fewer 
errors in 34 quarterly changes. This good showing, however, must be tempered 
by the fact that a sizable proportion of the errors in direction of change oc- 
curred in the neighborhood of cyclical turning points. For example, whereas 
provisional estimates of GNP disagreed with revised estimates on direction of 
change in only 5 cases, these occurred at the lower turning points of the 1948— 
1949 and 1953-1954 recessions. 

Finally, it must be remarked that the “seriousness” of the errors disclosed 
above can be appraised only with reference to the particular uses made of the 
provisional estimates. It is hoped that the analysis contained herein will provide 
a basis for the judicious use of provisional estimates and be of value to those 
engaged in the task of improving current estimates. 





» G. Jaszi, “The Statistical Foundations of the Gross National Product,” Review of E. ics and Statistica, 
38 (May, 1956), 205-14, 208. 








USE OF VARYING SEASONAL WEIGHTS IN PRICE 
INDEX CONSTRUCTION* 


Doris P. Roraweuu 
Bureau of Labor Statistics 


Seasonal variation in consumption presents practical and theoretical 
problems when index numbers of price change are placed in series. Tra- 
ditional formulas using varying seasonal weights for measurement from 
one month to the next will not yield satisfactory measures from year 
to year. This article describes a new type formula based on varying 
seasonal weights for month-to-month measurements which does not 
exhibit the “biases” of chain indexes and which satisfies classical index 
theory with respect to year-to-year comparisons. Results of statistical 
experimentation with alternative formulas are presented. 


EASONAL variation both in prices and quantity weights presents challenging 
a problems in price index construction for which traditional index theory is 
inadequate. Chief among them is the dilemma which arises because index num- 
bers are placed in series and because it is not possible in a single index to achieve 
accurate measurement of change both from month to month and year to year. 
Writers of textbooks usually confine their discussions to mathematical formulas 
and to single comparisons, chiefly annual. Index makers for the most part have 
devised practical working solutions without solving the basic issues. 

Clearly the seasonal problem has two aspects for price indexes: 


1. Seasonal price variation, 
2. Seasonal variation in consumption. 


For some purposes the desired solution may be elimination of seasonal price 
variation. Techniques have been devised for this and will not be discussed in 
this paper. 

The problem discussed here is the proper reflection of seasonal price varia- 
tion, taking into consideration seasonal variation in consumption. Use of annual 
weights, as for most indexes, simply ignores the effect of seasonal variation in 
consumption. 

The classical index theory of Laspeyres, Paasche, Edgeworth, Walsh! and 
somewhat more recently, Irving Fisher and Wald, rests on a framework of 
constant market baskets between consecutive periods. Yet it is the essence of 
the seasonal problem that market baskets are not constant from period to 
period. 





* The index method described in this article was developed by the author from earlier formulas, during work 
conducted over a period of 5 to 6 years beginning in 1949, at the Bureau of Labor Statistics. During that period 
considerable attention was paid to the possibility of using monthly seasonal weights in the revised Consumer Price 
Index issued in 1953, and extensive experimentation was conducted with alternative formulas. This is the work 
referred to by Bruce D. Mudgett in “The Measurement of Seasonal Movements in Price and Quantity Indexes” 
in the Journal of the American Statistical Association, 50 (1955), 93-8. For a number of reasons the Bureau abandoned 
the idea of using seasonal weights at that time. The results of the Bureau's investigations, however, may be of value 
to others. Although I benefited greatly by di i and suggestions of staff members of the Bureau, especially 
Abner Hurwitz, Chief of the Cost of Living Branch, this paper represents my own views. 

1 Walsh, Correa Moylan, The Measurement of General Exchange Value, 1901. 
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More recent writers on index numbers, such as Ragnar Frisch, Hans Staehle, 
and Francois Divisia, are generally more concerned with the economic theory 
of indexes, and more particularly cost of living indexes, and with the develop- 
ment of theoretically ideal indexes, than with the practical problems confront- 
ing index makers. Although these writers are concerned with the problem of 
changing market baskets, they do not deal specifically with seasonal variation 
within a calendar year, for which their solutions seem much less satisfactory. 

There are two generally accepted tenets of price index theory: 

1. A price index between any two given periods must be a function of changes 

in price of identical items; 
The best measure of price change between two given periods uses weights 
appropriate to one or both periods. 


The first principle—on the definition of an index—has been considered funda- 
mental—so much so that the literature frequently defines a price index as a 
weighted average of price relatives. This means, among other things, that a 
price index cannot change unless prices of individual items change. 

The second principle—on the nature of the weights—has been considered 
secondary—necessarily sacrificed to the first whenever price comparisons are 
placed in sequence in an index series. We find that traditional formulas for 
price indexes are all variations of the same basic formula: 


p> Pida 
De Pide 
p;=price in current period 


pi=price in earlier period 
ga = quantity weight. 


in which 


The summation here as elsewhere in the paper is over commodities included in 
the index. The various formulas define quantity weights differently but they 
are all alike in that quantity weights for a given comparison period are constant 
in numerator and denominator. This is true even of Divisia’s integral formula,’ 
the main point of this being that the difference in the time periods compared 
must be infinitely small. 


DILEMMA OF SEASONAL VARIATION 


Up to now, any proposed use of changing seasonal weights has foundered on 
the first basic principle, which is unworkable for both monthly and annual 
measurements. This is the so-called circular test which states that the product 
of season to season relatives based on varying weights for each comparison 
must yield an index which is a weighted average of price relatives over the year. 
Assuming 4 seasons, 1, 2, 3, and 4: 


I; 
Ferst di 








? In his formula the ratio of values is expressed as the product of price and quantity indexes which are solved 
for by differentiation. See Divisia, F., “L’Indice Monétaire et la Theoria de la Monnaie,” Revue d'Economie 
Politique, 39 (1925), 99-1004. 
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The test is whether this also equals J,; where J; is a weighted average of price 
relatives from period 1 to period 5, i.e., between the same seasons of successive 
years. Although this has been discredited by text book writers as a statistical 
test, practical index makers find it difficult to dismiss when year to year com- 
parisons give unrealistic results. As V. Hofsten said* “it is unnatural to accept 
an index which may after a year give a result different from 1, if the prices 
have returned to their initial values.” If we take as axiomatic that a valid price 
index is a function of changes in price only, i.e., that the total index can be 
expressed as some sort of weighted average of price relatives of individual 
items, then “correct” measurements of price change between seasons and 
between years are incompatible in a single index. Expressed algebraically there 
is no set of changing weights which will satisfy the equation: 


DP? Ds Dd aes Des «=D Gas 
x x x - ; 
Dar = ge Ps es |S a 





In other words there is no set of changing seasonal weights within a year such 
that the product of the season-to-season relatives during the year is also an 
index which is a weighted average of price relatives over the year. 

The second principle underlies most arguments in favor of chain indexes and 
against fixed base indexes. It is because of this that “chaining” together 
monthly links calculated with varying seasonal weights has been proposed as 
a better measure of monthly price changes. Unfortunately there is no standard 


formula which uses seasonal weights to measure price change from one month 
to the next which will give the most accurate measurement from one month to 
the same month of the next year. On the principle of appropriate weights, the 
change from one given month to the same month of successive years would 
use weights of the given month. As already indicated, this can not be the 
change which would result from multiplication of successive relatives of 
monthly price change. 

In brief, the two basic tenets are incompatible unless weights remain the 
same, i.e., unless market baskets are constant. 


PRESENT SOLUTIONS 


The practical builders of price and cost-of-living indexes throughout the 
world have been forced to find practical solutions for seasonal variation in con- 
sumption and in availability of prices. The problem is real—namely the fallacy 
of constant annual weights for combining price changes at different seasons. 

Some countries, notably Sweden, Denmark and the former British Mandate 
of Palestine, at various times have adopted procedures designed to eliminate 
the effect of seasonal price variation from their indexes. A number of countries, 
e.g., Britain, have used chain indexes with varying internal seasonal weights 
to measure average price change for certain groups of items such as fresh 
fruits and vegetables, but have combined the groups with constant annual 
weights. Some countries have tried pairing items available in different seasons, 
moving a total weight by the price change of the items in season. In the 1930’s 
Latvia used a straight expenditure ratio for food rather than a price index. 





* Erland V. Hofsten, Price Indexes and Quality Changes, Stockholm, 1952. 
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Other countries have compared prices and weights for a given month with 
prices and weights for the same month of the base year without much considera- 
tion of the meaning of the resulting month-to-month movement. 

In the United States up to now, both wholesale and retail price indexes of the 
Bureau of Labor Statistics have employed constant annual weights. Lack of 
prices for given items in given seasons is handled by estimation—either by 
holding constant until prices again become available, or by assuming the same 
price change for out-of-season items as for year-round items. Such matters are 
usually considered outside of the index formulation. 

Canada uses the U. 8. method for clothing, but has adopted a special formula 
for highly seasonal foods, fruits and vegetables, fats, eggs, and meats: 


__ dpa 
(1/12) >> Bodo 





Iu 


where 


q; = Weight appropriate for month j 
P.qo=sum of monthly expenditures in base year. 


This solution meets some but by no means all of the difficulties posed by 
seasonal variation. It does recognize variations in monthly market baskets. 
At the same time it avoids the inherent drifts of chain indexes which arise for 
long time, and particularly, year to year measurement when monthly links 
are placed in sequence. However, the index is actually a ratio of expenditures, 
in which the numerator is the product of monthly quantities and monthly 
prices and the denominator is 1/12 the annual value weight (or 1/12 the sum 
of monthly expenditures) in the base year. Therefore, some of the month to 
month fluctuation is due to differences in physical quantities. 


PROPOSAL 


The author’s formula is suggested as one possible solution to the dilemma of 
month-to-month change in an index series. As with all such proposals, the 
method has certain distinct advantages as well as certain difficulties. It repre- 
sents an adaption of earlier proposals which relies partly on classical index 
theory but avoids the pitfalls of chain indexes for month-to-month measure- 
ment. A number of systematic computations based on alternative formulas 
with realistic price data under various controlled conditions were made, which 
point up some startling consequences too easily passed over in theoretical 
analysis—particularly the inherent “biases” of chain indexes as compared to 
fixed base indexes for long time measurements. These are most apparent when 
both prices and quantities revert to the identical values of an earlier period—a 
not unrealistic condition in comparisons of one year with the next. 

First it seems reasonable to define the “correct” average price change be- 
tween the same month of successive years (when market baskets are identical) 
as a weighted average of price relatives over the year for all items included in 
the month’s basket using seasonal weights for the month. The definition of 
the “correct” price change between consecutive months (when items and 
weights of the market basket are different) is not obvious. 
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The proposed formula accepts tradition for the year-to-year measurement 
and compromises with tradition on the month-to-month measurement where 
no a priori reasoning as to the best measure is apparent. 

For convenience of exposition the formula will be presented in three forms 
which are algebra:cally equivalent: 

(A) comparison between comparable months of successive years, (B) 
index on base period, and (C) monthly links. It will be discussed in a frame- 
work of constant annual weights, although the formula does not preclude 
periodic weight revision. 


SYMBOLS 


T;, Tia1, ete. =index for current period 
P;, Pi, ete.=price for current period, corresponding to month s, s+1, of 
base period 
P., Py, ete. = price for given month of base period 
p.= weighted annual average price in base period 
Qs; Gert, ete. =seasonal weight appropriate to the months 7, 7+1, etc. 
go=annual base quantity weight, or the sum of the monthly 
seasonal weights 
pode = annual base period value weights. 


In normal use for consumer price or cost-of-living indexes, the values p,q, 
etc. represent actual monthly family expenditures. The value q, is the numerical 


sum of q., de41, ete. The value, is the annual average price, obtained by dividing 
the sum of monthly expenditures by q, or weighting the monthly prices by the 
seasonal weights. The value j.q, is the sum of monthly expenditures. 


(A) Relative between Comparable Months of Successive Years 
Form (A): 

y — pas 

hte t—12Y¥8 

I; Ld Pid Di-12 





Tine 2: Pi-12de D> Pi-12s 
This is the traditional form, in which weights g, (seasonal quantity weights 
for the particular month) are constant in numerator and denominator. Viewed 
independently and leaving out of consideration the basis for the seasonal q’s, 
this appears to be the most accurate measure possible of average price change 
between the same months of successive years. 


(B) Index on Base Period 


During the base year, 2 sets of monthly values totalling to annual base period 
value weights can be constructed: 


1) > PQs + ¥; PeiQeti * °° = 2 oo 
2) De Bode + De Poder: + - = DL Bago. 


The values of j.q., etc. are calculated values, representing the product of the 
annual average price (j,) and seasonal (monthly) quantity weights. 
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The formulation of the index on the base period is simply: 
Form (B): 


} Pids b P i419s+1 
a or | ere S = 
Dd Bod: Dd bodes 


In this form the index for any month is a weighted average of price relatives 
between prices for a given month and the base year annual average price, using 
seasonal weights appropriate to the given month. Each of the hypothetical 
values >-j.q., etc. will become a base for reference and each month of the year 
will have a different base period aggregate—the differences due solely to differ- 
ences in quantities, since the same prices are used throughout. Indexes for each 
month of the base period will not equal 100 but will be calculated by the above 
formula in the same way as in subsequent years and defined in the same way. 
They will reflect the average ratio between the monthly prices of the base year 
and the annual average prices of the base year using monthly weights. 
Ratios between the base period calculated values, e.g., 


>» Poo >. PoGe+t 
—_— or —_—— 
pi Pods y i Pods 
are a measure of the difference in annual average prices for a group resulting 


from different quantity weights. They also are a quantity index of seasonal con- 
sumption. The formulation of indexes for later periods on the base period can 


qT; etc. 


therefore be expressed as an expenditure ratio adjusted for the differences in 


quantity: 


_ Lepde — Le Pie Di Botte 
De Bote = DL Bolle Le Bolts 


This expression is similar to that for the month-to-month change discussed in 
the following section. 





T; 


(C) Month-to-Month Price Change 


There is no obviously “correct” measure of price change between consecutive 
months (when items and weights of the market basket are different). Concep- 
tually, the problem is identical to that of measuring differences in price level 
between two countries having radically different market baskets. 

The formula proposed provides an indirect technique for a price index with 
respect to month-to-month changes, within the framework of direct year-to- 
year measurement already described. This indirect technique has been used 
for other statistical series when the desired statistic cannot be measured directly. 
For example an index of physical volume can be derived by deflating (dividing) 
a value index by a price index. The BLS productivity index is obtained by 
dividing an index of physical volume of production by an index of man-hours 
worked. Why then can we not derive a price index by dividing a value index 
by a quantity index? This basic idea is inherent in the formula proposed by the 
German mathematician, M. W. Drobisch,‘in 1871 for measurement of changes 
in exchange values: 





«See Walsh, op. cit., pp. 98, 182, 383, 545. 
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This however contains the unrealistic condition that the quantities must be 
expressed in the same units so as to be additive. 

A ratio between the total values of the two different market baskets is a 
function of changes in price and quantity. If the effect of differences in quan- 
tities (product-mix) of two seasonal market baskets could be eliminated from 
such an expenditure ratio, the residue should represent a price index. 

A seasonal index of quantities can be calculated by usual index methodology 
—applied to quantities rather than prices: 


:> PaQe+i 
Lo Pads 
in which p, =constant price. The ratio of the values 


} PoGe+ 
Dd Ba 


(Form B) for the base period, in which the base period annual price is used as 
the constant price, is such a seasonal index. The product of these ratios over 
the year is unity. 

In the same way that an ordinary price index is a ratio between two expendi- 
ture values in which quantities are constant in numerator and denominator, 
this quantity index is a ratio between two expenditure values in which prices 
are constant. A straight ratio of aggregates in which both prices and quantities 
vary, divided by this seasonal quantity index, then will show the effect of price 
changes only. 

The formulation of the month-to-month change is then: 





(Expenditure Ratio) + (Quantity Index) 
Form (C) 


De pisigesr DY Boers 





pt LD bole 


In this form, the price index is an expenditure ratio divided by the quantity 
index calculated for the base year, or adjusted for the difference in quantities 
in the two periods. (It should be pointed out that the formula does not preclude 
periodic revisions of either the base prices used as weights in the quantity 
index or the quantities used as price weights.) This expression is equivalent 
to the ratio of indexes for two consecutive months calculated in Form B. 

It can also be shown by cancellation of terms that successive multiplication 
of monthly relatives in this form over the year will equal the simple monthly 
expression of year-to-year relatives (Form A) already given. 


| > PitiQe+i : > Pods | | ts Pi+2Qs+2 > PoGe+1 | saat Be Pi+129s 
DL pide = De Bogerr SLD pisrgerr DE Boddos Dd Pigs 
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In other words, the quantity index becomes unity when comparing comparable 
months of successive years. Form (C) thus becomes simple Form (A) from year 
to year or Form (B) on the base period. 


ANNUAL INDEXES 


One final word concerning the relation of monthly to annual indexes seems 
pertinent. Annual indexes, independent of monthly indexes, can be constructed 
from annual average prices, weighted or unweighted, or they may be calculated 
as averages of monthly indexes, weighted or unweighted. The former is the usual 
textbook treatment; the latter the general practice of index makers. Since most 
indexes use fixed weights, the two methods usually yield identical averages. 
Under most formulas, using varying weights, the two types of annual indexes 
give different results. Under the proposed formula, given proper monthly 
weights, the two types will yield the same answer. 

Even though problems of combining price changes of individual items with 
changing weights have thus far seemed insoluble, no such difficulty exists 
for combining monthly (or weekly) prices of an individual item with seasonal 
weights to obtain a properly weighted annual average price comparable with 
the base period. It seems certain that this is the best possible annual average 
price. This type of average is implicit in base year weights. It seems equally 
reasonable, if only annual indexes were needed, that the correct annual index 
is a weighted average of relatives between weighted annual average prices, 
combined with annual consumption weights. 


An advantage of the formula for monthly indexes discussed here is that such 
an annual index would in fact be a weighted average of the monthly indexes 
and could be calculated in this way, rather than by an independent calcula- 
tion. This can be shown algebraically. Assuming for simplicity only 4 seasons, 
in which 9.+4s41+4s+2+4s+3 =o, the annual average index would be: 


, § QsPi + > Qs+1Pi+1 + pe Qs+2Pi+2 + } 2m Qs+3P i+3 : 
Do Gobo 


But monthly indexes by the proposed formula are: 


> QsPi po Qe+i1Pi+1 > Qs+2Pi+e b » Qs+3Pi+3 ; 
Lah Le Gete.. Lo Gerke, Le Merahe 
Weighting these by base year values, 
a. qsPo Pa Qe+i Po y Qe+2Po ) Qs+sPo 
pe de> > iy alee 3 ee ert 














we find, 
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This is the independent average desired. 
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STATISTICAL EXPERIMENTATION 


In support of the proposed index formula, considerable statistical experi- 
mentation was performed at the Bureau of Labor Statistics using a number of 
different formulas, both with actual and “controlled” retail prices. The details 
of the calculations as well as statistical tables are included in a statistical ap- 
pendix available on request to the author. They were designed essentially to 
demonstrate the “drift” of chain indexes and the fact that most indexes which 
utilize varying weights do not meet the criterion of “correct” year-to-year 
price measurement described in this article. The results conclusively demon- 
strate the unsuitability of certain of these formulas for regular index calcula- 
tions. 

If we assume no change in annual consumption and a set of retail prices for 
4 years varying seasonally only, with the prices returning each year to the level 
of the corresponding month of the preceding year, we would expect indexes for 
a given month to be alike in all years. This is the case for any formula which 
maintains constant weights for all calculation periods, as well as for the for- 
mula proposed here. It is not the case for any chain index. Taking July as an 
example, the monthly indexes for fresh fruits and vegetables in four successive 
years when prices are identical were as follows by three well known chain, as 
well as the proposed, index formulas, using varying weights: 

(Indexes Year 1 =100) 


Proposed Marshall- 








Formula Laspeyres Paasche Edgeworth 
| | 
July, Year 1 98 .2 105.0 92.7 100.2 
a 98.2 111.3 79.5 | 95.7 
3 | 98.2 117.8 68.1 | 91.3 
4 | 98 .2 124.6 58.4 87.4 





Such definite drifts in chain indexes are clearly beyond logical acceptance; 
but the proposed formula satisfies this test of logic. 

If we then assume 1 per cent per month price increase for every item in the 
group, we would expect the group change to be 1 per cent per month, (1.01” 
— 1.00) or 12.67 per cent per year, and (1.01%— 1.00) or 27.0 per cent in 2 years. 
Following are the results of this calculation for the above formulas, using 
January as an example: 


(Indexes Jan., Year 1 =100) 





Proposed | Marshall- 





Formula Laspeyres Paasche Edgeworth 
| — — ciao 
Jan., Year 1 100.0 100.0 | 100.0 | 100.0 
2 112.67 120.0 97.1 108.3 





Certainly we cannot accept a decreasing index, as for Paasche, in the face 
of higher prices for every item, nor would we rely too much on an index which 
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rose 20.0 per cent in a period when every item rose 12.67 per cent. Again the 
proposed formula meets this test of logic, whereas ordinary chain indexes do 
not. 

Two charts are presented here for illustration of the experimental calcula- 
tion. Chart 77A shows an index of retail prices for fresh fruits and vegetables 
showing seasonal movement only, by two standard formulas with constant 
annual weights (Laspeyres), as compared with the movement using seasonal 
weights by the formula discussed in this paper. The two standard formulas 
using annual weights differ only in the treatment of items not in season. In the 
one case they are held constant, in the other assumed to move with the group. 
For these calculations, this difference in treatment had relatively little effect. 
Although index fluctuations under the suggested formula were somewhat larger, 
the general pattern of seasonal movement is basically the same. The essential 
point to be gleaned from this chart is that the identical movement is repeated 
in subsequent years. 

Chart 77B, showing 3 chain indexes based on seasonal (monthly) weights but 
identical prices, is in striking contrast to Chart 77A. There is a wide divergence 
in index movements by the three formulas and more importantly, the indexes 
do not return to the same level in succeeding years when prices are identical. 
The Paasche chain has a very strong downward drift and the downward bias, 
as is customary for such indexes, increases with the number of links. At the end 
of 4 vears (48 links) the Paasche index is down to something over 50 as com- 
pared to 85 at the end of the first year. The Laspeyres chain has an upward 
bias and the Marshall-Edgeworth, as expected, is in between. What makes 
these indexes inacceptable on pure logic is a decrease or an increase in the 
average change over the year when prices of every item are unchanged (by 
design), and weights are the same (also by design). 

Clothing indexes show similar “drifts” but since the month-to-month changes 
are much smaller than for fruits and vegetables, the biases are less apparent. 
Formulas based on fixed annual weights and the formula under discussion show 
no such drift. 

CONCLUSIONS 

It seems desirable to summarize briefly the results of the investigations de- 
scribed. It was pointed out that traditional index theory assumes constant 
market baskets but that market baskets are not constant from month to month 
with respect to seasonal items. Moreover accurate measurement of price change 
from month to month and year to year are incompatible in an index series by 
ordinary formulas. 

Index makers have devised working solutions of the seasonal problem without 
coming to grips with the statistical dilemma. For the most part the solutions 
ignore or circumvent the problem. 

To students of index numbers, the chain index is the statistical solution to 
changing weights. As a matter of fact, even so-called fixed base annual weight 
indexes are really chain indexes, since any index number series which main- 
tained constant weights for too long a period would be so unrealistic as to be 
inacceptable. Yet chain indexes have inherent biases, demonstrated in an exag- 
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gerated way through the experimental calculations with seasonal weights per- 
formed by the Bureau. These results should make index makers hesitate to 
accept chain indexes too readily to solve the problems of seasonal consumption 
or annual weight revisions. 

The writer offers an index formula which combines direct measurement of 
price change from the base period and from year to year, using constant weights 
by traditional methods, with indirect measurement of month-to-month change 
employing varying seasonal weights. It has the following points in its favor: 

1. It recognizes seasonal variation by using weights appropriate to both months in 
any price comparison. 

2. It gives a measure of price change between the same month of successive years 
which meets statistical and logical tests. 

3. It does not depend (as do ordinary chain indexes) upon a constant list of commodi- 
ties in successive pricing periods. Thus it makes possible proper reflection of price 
change from the end of one season to the beginning of the next. 

4. Unlike certain other seasonal formulas, it can be calculated with reference to a 
year or average of several years as a base. 

5. It makes possible an annual average of monthly indexes which is identical to a 
properly and independently calculated annual index. 


The important objection which has been raised and the main reason why the 
formula was not accepted for the revised Consumer Price Index is that the 
month-to-month change cannot be explained as a simple price change between 
constant market baskets. It is the writer’s opinion that the criticism is not valid 
because family expenditures in reality are not constant from month to month. 
Primarily it is her contention that the theory of price index construction de- 
serves searching study and modernization with respect to changing weight 
patterns. 
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PHILIPPINE STATISTICAL PROGRAM DEVELOPMENT AND 
THE SURVEY OF HOUSEHOLDS 


Miuton D, LizBERMAN 
U. S. Bureau of the Census 


The Philippines has embarked on a comprehensive program of sta- 
tistical development. The main survey device employed is the Philip- 
pine Statistical Survey of Households which is producing vitally needed 
data for planning and administration. Operating and conceptual prob- 
lems experienced may be similar to those in existence in other coun- 
tries of like economic and social characteristics. 


INTRODUCTION 


N accordance with an agreement between the National Economic Council 
] representing the government of the Republic of the Philippines and the 
International Cooperation Administration of the United States a compre- 
hensive statistical project was commenced late in 1955. The project recognized 
the long felt need for good current basic demographic and economic data. The 
last national census of the Philippines, and incidentally the first as an inde- 
pendent Republic, had been taken in 1948. Besides being out of date there 
were numerous charges leveled that the published statistics have been in- 
ordinately influenced by the unrest prevailing in the country at that time, by 
political conditions extant, and by the lack of adequate preparations and 
training. 

The present U. S. technical assistance mission to the Philippines started 
operations in 1951. yo immediately the lack of good statistical data was 
apparent. Only piecemeal efforts were made to collect and present the informa- 
tion needed for policy planning, technical programming, and day to day opera- 
tions. The two most obvious handicaps were lack of funds appropriated by the > 
Philippine government for statistics and the extreme scarcity of statisticians 
and other trained personnel. Starting as early as 1953 the U. S. operations 
mission and the counterpart Philippine officials commenced planning the 
statistical project. But it was not until 1955 that the project was accepted and 
inaugurated. 

A special agreement was entered into between the ICA and the U. 8S. Bureau 
of the Census whereby the Bureau undertook to provide a team of statistic ians 
who were to supply technical guidance and share operational responsibility with 
the designated Philippine government counterparts. 

The terms of reference of the project were stated in the broadest terms. The 
Statistical Survey on Population, Manpower and Economic Activities is de- 
signed (1) to produce up-to-date statistical data on the size, composition, and 
distribution of the population, manpower supply and demand and economic 
activities of the country and establish reporting systems to maintain such data 
on a current basis, and (2) to improve the over-all statistical operations of the 
Philippine government. 








PHILIPPINE STATISTICAL PROGRAM 79 


It was recognized that the problems of statistical operations existed in 
virtually every field. Hence a number of Philippine agencies became principals 
in the implementation of the programmed portions of the statistical project. 
These are the National Economic Council as the coordinating agency; the 
Bureau of the Census and Statistics as the primary operating agency; and the 
Department of Agriculture and Natural Resources, the Department of Labor, 
the Bureau of Commerce of the Department of Commerce and Industry, and 
the Economic Research Department of the Central Bank of the Philippines as 
the main cooperating agencies. Technical assistance and financing arrange- 
ments are channeled through a special project unit set up under the Office of 
Statistical Coordination and Standards of the National Economic Council. This 
unit carries on the functions of planning, organization, coordination, super- 
vision and execution of the various sub-projects. A project director appointed 
by the President of the Philippines is responsible to the Chairman of the 
National Economic Council and is the counterpart of the Chief of the U. S. 
Statistical team. 

This paper is not planned to cover the entire statistical project nor the en- 
tirety of the work of the U. 8. Statistical team. Instead it is limited to a 
description of the Philippine Statistical Survey of Households which was or- 
ganized and set into operation as the keystone of the statistical project. A 
mention of the remaining statistical activities being carried on as part of the 
project will suffice to indicate the breadth of the program. 

1. Survey of Manufactures—This has been established as an annual survey 
with quarterly supplements and is aimed at securing data on employment, 
earnings, production, materials consumed, value added, financing, capital for- 
mation and other data. 

2. National Income Estimates—Starting with 1957 the national income 
estimates are being prepared and presented by the Office of Statistical Coor- 
dination and Standards of the National Economic Council. Previously these 
estimates were produced by the Central Bank of the Philippines. 

3. Improvement of Agricultural Statistics Project—This consists of a number 
of sub-projects as follows: 

(a) Updating of Agricultural Statistics—Tabulation and publication of col- 
lected but unprocessed crop and livestock data. 

(b) Prices Received by Farmers—This is a survey of the prices received by 
farmers for their products sold at local markets or at points where farmers 
deliver their products to buyers. 

(c) Prices Paid by Farmers—During the fiscal year ending June 30, 1958 a 
study on prices paid by farmers will commence. 

(d) Costs of Production—This is a nation-wide sample survey on the costs 
of production of a selected number of crops; namely, rice, corn, abaca, tobacco, 
cattle and hogs. 

(e) Capital Formation in Agriculture—This is a nationwide sample survey 
which collected data on value of capital formed in the agricultural sector. 

(f) Rice Milling Reeovery—This is a small sized sample study of rice mills 
which attempted to measure the yield of rice from the harvested palay (paddy). 
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4. Vital Statistics Project—This project eliminated a sizeable backlog of 
unprocessed birth and death certificates. 

5. Foreign Trade Statistics Project—This project provided funds for the 
processing of a large backlog of unprocessed import and export documents. 

6. Nutrition and Dietary Survey—A sample survey was taken in selected 
households to study nutrition and dietary deficiencies of the population. A 
follow up in sampled households by a biological and medical team was also 
included in the program. 

7. Mapping and Census Preparation—Basic maps and other essential source 
material required are being assembled in preparation for the next national 
census of the Philippines. 

Other statistical activities of the Philippine government including extension 
of a retail and wholesale price collection system, trade statistics collection and 
compilation, financial statistics, etc., are not mentioned as part of the Statis- 
tical Project inasmuch as they are financed separately. Nevertheless they are 
considered as a part of the entire coordinated system of government statistics 
which is presently being developed. 


THE PHILIPPINE STATISTICAL SURVEY OF HOUSEHOLDS 


The Philippine Statistical Survey of Households was pianned, developed, 
and installed as the chief mechanism for securing vitally needed data on the 
size, composition, and distribution of the population, labor force, economic 
activities, and other related information on the inhabitants of the Philippines. 
Modelled in some respects on the Current Population Survey of the U. 8. Cen- 
sus Bureau! the household survey collects data on the labor force, employment, 
and unemployment characteristics of the population as well as other data re- 
lating to the household. While not embodying any particularly new principles 
of sampling, the Survey is an entity which is expected to supply an increasing 
flow of economic and demographic data for government and private consump- 
tion. The sample design used is a three-stage one and has produced about 6,000 
enumerated households from 145 municipalities, 299 barrios, 145 poblaciones, 
and 58 provincial capitals and cities. 

The formation of 10 regional divisions of the Philippines was made in advance 
of the other selections; these divisions considered ethnic origin, language 
spoken, economic activities, climate, major crops grown, and other factors. The 
sampling plan, stratification, and selection procedures are illustrated separately 
for the rural and urban areas. 

While the sampling plan does not present innovations, it is significant to 





1 Current Population Reports, Series P-57, U. 8. Department of Commerce, Bureau of the Census. 

* Philippine Statistical Survey of Households Bulletin, May, 1956, Series No. 1, Vol. 1. >> 

> This selection was based on the Tukey plan (W. E. Deming: Some Theory of Sampling—pp. 96, 353) and used 
by Lahiri in the National Sample Survey of India. In each stratum the sample units were selected and their order 
of selection noted. The units which had number 1 order of selection in each stratum formed a subsample of the total 
sample. Similarly the units with number 2 order of selection formed a subsample as did numbers 3, 4, and 5. Each 
of these subsamples provided independent estimates of the characteristics being investigated. This method of 
selection readily permits estimates of the national total of characteristics by obtaining the average of the 5 estimates 
from the independent subsamples and the variance of the national total estimates by considering the deviations of 
each subsample estimate from the average. 
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SAMPLING PROCEDURE FOR RURAL AREAS* 








Type of Sampling 


Stratification 


Selection of 
municipalities 


Selection of 


poblaciones and 


households 





1. Three stage sampling: 


a. First stage: 
Municipalities 


b. Second stage: 
Barrios and 
poblaciones 


c. Third stage: 
Households 


2. Number of house- 
holds selected: 


a. Barrios: 3,000 
b. Poblaciones: 1,500 





. Municipalities 


were arranged 
into 10 regions 


2. Municipalities 


were stratified 
according to 
their densities 
of population in 
ascending order. 


. Strata were 


formed by count- 
ing of their ap- 
proximate spec- 
ified popula- 
tion starting 
from the mu- 
nicipality with 
the lowest den- 
sity. 


. The strata were 


adjusted so that 
their population 
are approxi- 
mately equal 
and in no case 
was a munici- 
pality divided. 


. 30 strata were 


formed. 





. Five municipal- 


ities were ran- 
domly selected 
with probabil- 
ity proportional 
to size of popu- 
lation and with 
replacement in 
each stratum. 


. Selection of the 


subsamples was 
made.» 


. 145 municipali- 


ties with 1 repe- 
tition each in 5 
cases were se- 
lected from all 
the 30 strata. 


. Sample 





1. Two barrios and 


the poblacion of 
selected munici- 
pality were se- 
lected for com- 
plete listing of 
households. 


. 145 sample po- 


blacionesand 299 
sample barrios 
were selected 
from 145 sample 
municipalities. 


. Sample barrios 


were selected 
with equal prob- 
ability and with- 
out replacement. 


. Poblaciones of 


selected munici- 
palities were au- 
tomatically made 
sample _ pobla- 
ciones. 


. All households in 
each selected bar- 
rio or poblacion 
were completely 
listed. 


. The number of 


sample _house- 
holds to be enu- 
merated was de- 
termined by the 
product of aspec- 
ified proportion 
and the total 
number of house- 
holds in the se- 
lected barrios or 
poblaciones. 


house- 
holds were se- 
lected systemat- 
ically. 
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point out that methodical controlled operations inherent in any successful 
sampling scheme have been adhered to in the survey. In the Philippines the 
difficulties of inaugurating a widespread sample survey were compounded by 
geographical problems, poor transportation facilities, some law and order prob- 
lems, and numerous dialects in use. Further, the need to create a new survey 
staff from inadequately prepared and trained personnel was an initia] task beset 
by many difficulties. A small nucleus staff consisting of two persons who had 
received training at the U. S. Bureau of the Census and several others who had 
previous experience at the Philippine Bureau of Census and Statistics was re- 
cruited. A budget was drawn up and preparatory operations commenced late 
in 1955. 

Ten regional supervisors were recruited from a large list of applicants. Re- 
cruitment was based on written examination, personal interview, previous 
work experience, education, and location of the applicant’s home province. 
Insofar as possible it was desired that regional supervisors come from the area 
of supervision but travel and per diem costs were kept down by confining re- 
cruitment to Manila. An initial extensive training period was carried out which 
included frequent field tests of the questionnaire. Within their designated areas 
of supervision the regional supervisors then carried out all the administrative, 
technical, and supervisory tasks required in setting up the survey. These tasks 
included the establishment of good working relations with provincial governors, 
municipal mayors and other officials; recruiting and training interviewers; 
supervising the progress of the work: issuing press releases; serving as pay- 
masters; and maintaining close liaison with the central office in Manila. During 
each of the survey cycles the central office provides certain professional em- 
ployees to conduct specified lectures and aid in the instruction of the more dif- 
ficult concepts involved in the enumeration. Initially some personnel were bor- 
rowed in some instances from other government agencies which were concerned 
in certain aspects of the survey. 

The first survey was taken in May, 1956, and three more surveys were taken 
in October, 1956 and March and May, 1957. In fiscal year 1958 only two sur- 
veys are presently planned, namely October, 1957 and May, 1958. The first 
survey had originally been planned to include basic demographic items like 
sex, age, marital status, education level, and fertility data for women ever 
married, plus a detailed list of questions relating to the labor force. However, 
the paucity of available data on income, expenditure levels, housing and cottage 
industry caused queries on these subjects to be included. 

Data on income and expenditures are considered study items and were not 
planned for publication. Based on this study a comprehensive income and 
expenditure study was conducted in March, 1957, along with the demographic 
and labor force items. The October survey was limited to the labor force series 
of questions. The May, 1957 survey included the labor force complex of ques- 
tions plus some other items carried the previous year which accordingly per- 
mitted a study of year to year changes. 
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SAMPLING PROCEDURE FOR URBAN AREAS* 
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Type of Sampling Stratification 


Selection of 


Selection of 











Precincts Households 
1. Two stage sampling: | 1. The chartered | 1. Five precincts | 1. All the house- 
|  ¢ities and pro- were selected at holds in selected 
a. First stage: vincial capitals random from cities and pro- 
precincts were grouped each = stratum vincial capitals 
| into ten regions with equal prob- were listed. 
b. Second stage: | after the three ability and with 
households | big cities, name- replacement. 2. The number of 
ly Cebu, Iloilo, sample house- 
2. Number of house- and Davao were . 149 precincts, holds to be enu- 
holds selected separated. with 1 repeti- merated was de- 
tion in 1 case | termined by the 
a. Chartered cities | 2. These werestrat- were selected | product of speci- 
and provincial | ified according from all char- fied proportion 
capitals—1,200 to degree of ur- | tered cities and | and the _ total 
banization; i.e. | provincial capi- | number of house- 
b. Metropolitan percentage of | tals; 159 pre- holds in selected 
Manila—8s00 urban precincts cincts, with 1 | precincts. 
in the descend- | repetition in 1 
ing order. case, in Manila. | 3. Systematic _ se- 
lection of house- 
| 3. Strata were holds to be enu- 


| formed by count- 
ing off their ap- 
proximate spec- 
ified number of 
precincts, start- 
ing from the 
| highly urbanized 
city or provin- 
cial capital. 


| 4. The strata were 
adjusted so that 
the number of 

precincts in each 

strata is approx- 
imately equal 
| and in no case 
was a city or 
provincial capi- 
tal divided. 


5. 30 strata were 
formed in cities 
and provincial 
capitals includ- 
ing the three big 


cities; 32 in 
Metropolitan 





Manila. 














merated 
made. 


was 





*® Philippine Statistical Survey of Households Bulletin, May, 1956, Series No. 1, Vol. 1. 
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THE MAY, 1956 SURVEY 


By and large the timetable of operation was successfully followed. In areas 
like southern Mindanao, Palawan, and in Central Luzon where transportation 
facilities were especially difficult, the listing and enumeration operations took 
longer than expected. The total number of field staff was about 300 and in 
general one enumerator covered a single municipality consisting of the pobla- 
cion and two sample barrios. In the cities sample precincts were generally 
covered by separate enumerators. 

Questions on the schedule were divided by blocks as follows: identification; 
demographic data; economic activities for persons 10 years of age and older; 
household industry; housing data; agriculture data for farm households; in- 
come data; expenditures data (limited to a 20 per cent subsample). 

The questionnaire form was voluminous especially with respect to income 
and expenditure data and while there has been some criticism leveled at the 
length there was good evidence that there was no adverse effect on the quality 
of data. One real innovation in Asian survey taking was the adoption of a 
scientifically selected sample of households for a post enumeration survey. This 
post enumeration survey attempted an objective quality check limited to 
households which were taken in the original survey. The P.E.S. did not attempt 
to check household selection within sample areas nor did it check for possible 
erroneous coverage outside the sample areas. This area of possible error was 
covered by the regular supervisory and checking system carried out during the 
course of the survey. Differences between the original interview and the post 
enumeration interview were revealed and will be separately presented. The 
incidence of specified errors or differences in regional areas and in municipalities 
was studied and used in great advantage in reviewing the work of field per- 
sonnel and for planning in subsequent surveys. 

Schedule processing, including editing, coding, punching, and tabulating are 
carried out in sequential steps; I.B.M. machines have been utilized. The survey 
serves as an excellent laboratory and training device for census planning. 

Many of the concepts evolved from the initial survey were based on condi- 
tions existing in the Philippines and are not necessarily applicable in all coun- 
tries. Some countries of the ECAFE region, however, may study the Philippine 
survey since similar conditions may exist to a greater or lesser degree.? The 
complex of questions used to derive the labor force components are most 
illustrative of this point. First of all the questions on economic activities 
included all persons at least 10 years of age and older. This inclusion followed 
Philippine census practices and permits special analysis of the group aged 10 to 
13 years. The work status for a year for each individual was initially estab- 
lished; then the emphasis was put on a single week. The categorization into 
those working, doing housekeeping, attending school, and doing nothing in 
this week then followed. A check question was asked of those persons who 
reported themselves as either doing housework, attending school or doing 
nothing to validate their previous response. The remaining questions in the 





2 Recent information indicates that Viet Nam and Pakistan are inaugurating sample household surveys with 
U. 8. Census Bureau/International Cooperation Administration technical assistance. 
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block then attempted to determine if a person had a job or business but was not 
at work; reason for not working; wanting work; reason for not wanting work; 
previous work experience, if any; looking for work; reason for not looking, in- 
cluding belief that no work was available; interval since last regular employ- 
ment; and duration of work availability status. Data on occupation, industry, 
wages and salaries, information on additional jobs held, disability as a factor 
in not working and some indication of internal movement for employment 
reasons completed the interrogation relating to the economic activities of the 
members of the sample households. 

Household industry, housing data, and farm information for farm house- 
holds, together with the income and expenditure blocks completed the ques- 
tionnaire. 

The labor force classification and its conceptual components was derived as 
follows: 

In Labor Force Working (Incl. persons with a job but not at 
work) 


Agriculture 
Non-agriculture 


Employed 


Wants and looking for work 
Experienced 
New worker 


Wants but not looking for work (Believe no 
work available) 

Doing nothing 

Housewives 

Students 


Unemployed 


Wants work but not reported whether look- 
ing for work or not 


Not In Labor Force Wants but not looking for work for reasons 
other than the belief no work is available 
Experienced 


New workers 


) Not wanting work 
Housewives 

Students 

Disabled, retired, others 





Employment status unknown 


The October, 1956 questionnaire had some minor changes in format which 
were occasioned by the desire to secure more definitive response to the ques- 
tion; reasons for not seeking work. Also the questionnaire was simplified by the 
elimination of some demographic data, and the dropping of the inquiries re- 
lating to housing, home industry, agriculture, income, and expenditures. Minor 
changes in format and content were made in subsequent surveys in efforts to 
improve clarity and response. 
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ESTIMATES OF THE POPULATION 10 YEARS OLD AND OVER BY 
LABOR FORCE STATUS BY SEX FOR THE PHILIPPINES IN 
MAY, 1956 ARE SHOWN IN THE FOLLOWING TABLE* 





Per cent of total 
labor force 





Per cent of total 10 


Number (thousands) years old and over 


Labor force status 





Male \Female 


Total 


Male | Female 





Total persons 10 years old and over” 


In labor force 
Employed _ 
At work 


14,587 


| 7,463 


Total 


Male 


Female| 


Total 





100.0 


100.0 


100.0 


} 





9,497 
8,315 
7,885 


3,726 
2,988 


65.1 
57.0 
54.1 


| 
50.3 
40.4 
37.1 


100. 
87. 
83. 


430 
1,182 
459 


With a job but not at work 
Unemployed 
Worked before 
Never worked 723 
Not in labor force® 5,062 | 
Labor force status not reported 7 | 
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* Philippine Statistical Survey of Households Bulletin, May, 1956, Series No. 1, Vol. 1. 

> Persons living in households except for those living in embassies or consulate premises, foreigners visiting or 
traveling in the Philippines. Does not include personnel of the armed forces living in military installations, persons 
living in hotels, or aboard ships or i tes of institutions (penal, charitable, mental, hospitals, and the like). 

© Includes persons not wanting to work such as those keeping house, going to school, idle dependents, disabled, 
retired and others. 





Detailed tables on the survey have been published and are being extensively 
used by the agencies of the Philippine government. Much interest in the Philip- 
pine Statistical Survey of Households has been displayed by other countries 
of the ECAFE region. Few large periodic household sampling surveys are 
known to be in operation in Asia* and the creation of a similar mechanism in 
other countries could well be the answer to the need for basic statistical data. 
As the decade moves along to the point where most countries will be taking 
national censuses the establishment of concepts, definitions, programs and 
procedures becomes a certain requirement. The United Nations has taken the 
lead in this planning and such activities as the annual conference of Asian 
statisticians and frequent meetings of European and American statisticians 
are assembled to provide exchange facilities for the country technicians con- 
cerned. 

Prior to the survey some estimates of the unemployed ran as high as 3 million 
persons and as a consequence there was no reluctance on the part of the goverr- 
ment to accept the 1.2 million estimate advanced by the survey; the fact that 
the survey was taken during the school vacation period made the statistics even 
more palatable. The issuance of the estimates fortunately came at a time that 
the National Economic Council was preparing a 5 year plan of development 
and hence needed the figures provided by the survey. 

While much attention has been focused on the level of unemployment, the 
nature of the employed labor force must be closely scrutinized. In this connec- 
tion the revelation of underemployment is a matter of utmost concern to the 





* The primary responsibility for the sample design of the Philippine Statistical Survey of Households was 
carried by Satya B. Sen of the Indian Statistical Institute, who was assigned by the United Nations as Senior 
Statistician at the University of the Philippines Statistical Center. The use of replacement in sample selection is 
followed in the National Sample Survey of India on which Mr. Sen has worked for some years. K. C. 8S. Pillai of the 
United Nations Statistical Office succeeded Mr. Sen in January, 1956. 
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country’s planners. Serious mindea administrators recognized that an exami- 
nation of the employed population estimates did not give as rosy a picture as 
one would like. Some significant measure of underemployment may be indi- 
cated by the fact that 11 per cent of those reporting worked less than 20 hours 
while 21.7 per cent worked less than 30 hours. 

The bare reporting of hours worked and the use of these data as a basis of 
determining underemployment may establish a serious misunderstanding of the 
nature of the labor force and the prevailing practices of the Philippines. The 
close family ties of people whereby near and distant relatives live in the house- 
hold of an earner, but themselves do not work, seems to be an accepted part 
of the Philippine way of life. This family interdependence and its effect on the 
living levels of the people seems to be generally experienced throughout south- 
east Asia as indicated by the writer’s experience in Burma and other countries 
of the ECAFE area. 

The measurement of underemployment or even the satisfactory defining of 
such a condition then involves conceptual problems which are largely non- 
existent in «he west. Hours worked for an individual may prove to be an un- 
satisfactory criterion in the determination of effective employment. The use of 
unpaid family workers especially in the rural sector is certainly widespread and 
the fact that the number of hours worked is most difficult to obtain by inter- 
view tends to further obscure data utilized in the determination of optimum 
employment. The table below will illustrate the disproportionate number of 
workers employed less than 40 hours during the survey week. 


WORK STATUS AND WEEKLY HOURS WORKED OF PERSONS 
EMPLOYED IN AGRICULTURE* AND NON-AGRICULTURAL 
INDUSTRIES BY SEX, FOR THE PHILIPPINES: MAY, 1956 








Total | In agriculture | In non-agricultural industries 
Work status and weekly * j " 
hours worked 





Both 
Male Female | pane Male 


Both 
} 


Female one Male | Female 








1,328 3,268 1,608 1,660 





Number employed (thousands) 5,387 2,988 5,047 $,719 
Number at work (thousands) §,140 2,745 4, 8.625 1,261 | 2,999 1,615 1,484 
Total per cent 100.0 100.0 100.0 100.0 | 100.0 100.0 100.0 100.0 





ne 


4.9 6.0 
5.8 9.6 
7.6 15.0 
4.0 6.4 
4.8 5.5 
40.7 23.6 
17.4 13.2 
3.6 5.4 
11.1 14.4 
44.6 42.0 


—— 


Less than 10 hours a 5. 
10-19 hours a ‘ 9. 

4 

7 


- 
a 


20-29 hours 

30-34 hours 

35-39 hours 

40-48 hours 

49-56 hours 

57-64 hours 

65 hours and over 
Median Hours Worked 
Number with a job but not at | 


work (thousands) 430 | 161 94 


® Philippine Statistical Survey of Households Bulletin, May, 1956, Series No. 1, Vol. 1. 
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One proposal which has been recently advanced suggests that underemploy- 
ment be considered as a function of household income. This type of treatment 
would imply the acceptance of the family unit as a labor force unit with specific 
criteria of household-income-level classes set up for various sized family units. 
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Such data could be prepared for as many different kinds of economic, demo- 
graphic, and geographic categories as desired and as permitted by the sample 
size. 

In clearly defined urban and industrial employment areas the problem is 
considerably less and presumably the criterion of hours worked alone may serve 
to indicate level of underemployment. 

One obstacle to the use of income as a determinant of underemployment is 
the fact that good household income statistics have not been available. The 
survey taken in March, 1957 included comprehensive queries on income and 
expenditures and it is expected that the results will be published by the end of 
1957 or early in 1958. 


RELIABILITY OF THE SURVEY 


The coefficient of variation for the total population estimate was 3.46 per 
cent which was somewhat higher than originally expected. An intensive project 
to reduce the variances of the estimates in subsequent surveys was undertaken. 
The original selection of sample barrios without stratification by population 
size had been based on expediency since no authoritative population estimates 
existed for most barrios in the country since the 1948 census. Moreover the 
large number of changes in barrios boundaries, changes in names, changes in 
composition, and the elimination and creation of new barrios since 1948 pre- 
cluded original stratification of the barrios. Hence a sizable contribution to the 
sampling error can be accounted for by the irregular population size of the 
second stage units. 

As part of the field work connected with the October, 1956 survey, new 
estimates of population were secured by field personnel from local officials; in 
addition the locations of barrios were noted on a map. A plan was adopted 
whereby no barrio with less than 450 population would be taken as a sample 
unit. Original barrios were retained in all cases but if their population was less 
than 450 an adjacent barrio was added so that the sample areas were more 
uniformly sized. Additional listings were made and with the total population 
of the sample areas as the characteristic some test coefficients of variation were 
computed. The net result of this procedure was to reduce the coefficient of 
variation from the 3.46 per cent of the May, 1956 estimate to 2.44 per cent 
for the March, 1957 survey. No tests were made for other characteristics be- 
sides total population. At this writing preliminary computations have been 
completed for the May, 1957 survey and the coefficient of variation is estimated 
at 2.35 per cent. No data from the October, 1957 survey are yet available. 





TINBERGEN ON ECONOMIC POLICY* 


KENNETH J. ARROW 
Stanford University 


AN TINBERGEN’s study of economic policy deals with three levels: the econo- 
J metric framework, the personal experience derived as Director of the Cen- 

tral Plan Bureau of the Netherlands during the bulk of the post-war period, 
and a broad ethical outlook. The cold impersonality and attempted precision 
of econometric analysis, the vigor and promptness of decision necessary to the 
executive, and the warmth of a passion for social justice are the incongruous 
but indispensable elements of economic policy. Tinbergen is surely one of the 
few people who, by education, experience, and outlook, can bring all three sides 
into consideration though perhaps not always to resolve resulting contradic- 
tions. Tinbergen’s book is a necessary starting point for all those who are inter- 
ested in the formation of economic policy in situations where the state is con- 
sidered to play an active role. 

I will first follow Tinbergen’s order in dealing successively with his considera- 
tions on the aims of economic policy, quantitative aspects, qualitative aspects, 
and the execution of policy. I will then supplement this by some questions about 
the formation policy under consideration under conditions of uncertainty and, 
in particular, the role of the price system. 


1. THE AIMS OF ECONOMIC POLICY 


Tinbergen regards economic policy in principle as a maximization of a social 
welfare function subject to constraints imposed by technology, resources and 
to some extent political feasibility. Although aware of the difficulties in defining 
a social welfare function in a democratic society, he regards it as a sufficiently 
good approximation to take a consensus of the views of policy-makers. He sug- 
gests the following components as part of a general consensus on economic aims 
in modern times: (a) international peace; (b) maximum real expenditure per 
capita with full employment and monetary equilibrium; (c) distribution of in- 
come over social groups and countries; (d) emancipation of underprivileged 
groups, and (e) “as much personal freedom as compatible with the other aims.” 
While the ultimate purpose is maximization of the social welfare function, as 
a practical proposition this may be replaced in particular instances, in whole or 
in part, by prescribing fixed values for some of the variables in social welfare 
function and regarding them as targets. Most of the analysis is in fact conducted 
with regard to the achievement of fixed targets and as might be expected 
usually with reference to the aims subsumed under (b). 

There are some comments that might be made about the announced aims of 
policy. Most of them are used by Tinbergen only in reference to qualitative 
policy—that is, to changes in the form of organization of society. Actually, the 





* An invited review article on Jan Tinbergen, Economic Policy: Principles and Design. Amsterdam: North- 
Holland Publishing Company, 1956. Pp. xxviii, 276. 26.50 guilders or $7.00. 
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aim of peace, despite its prominence on the list, is referred to again only once 
in the book and then in a manner which can be described best as an obiter 
dictum. Clearly, peace affects economic well-being as it affects most other things 
but it seems to me that it can hardly be regarded as an aim of policy until it 
can be shown that economic policy affects peace. Of course, such effects do 
exist ; foreign economic aid and tariff policy are among the policies which have 
implications for peace. Whether or not the economist is in much of a position 
to advise on the consequences of such policies with respect to the achievement 
of peace is perhaps more open to question. 

The items under (b) are perhaps also interesting. Maximization of real ex- 
penditure is, of course, a classical—perhaps the classical—economic aim. How- 
ever, perhaps something must be said of its distribution over time; otherwise 
we have no valid ‘criteria for deciding between investment and consumption. 
The aims of full employment and of monetary equilibrium raise interesting 
questions. Are they, in fact, separate from the aim of maximum expenditure? 
The usual economic argument for full employment is that it is a necessary 
condition for maximum income. Tinbergen, however, seems to regard full em- 
ployment as an aim in itself because of the importance of work to the individual. 
While the difference of opinion here is perhaps interesting intellectually, it does 
not seem to lead to any important policy differences, so we need not pursue it. 

The assumption of the desirability of monetary equilibrium seems to me, in 
certain circumstances, somewhat more debatable as an end in itself. Tinbergen 
here, of course, is taking a thoroughly classical position. One can argue quite 
correctly that a stable price level is indeed one of the signals of the achieve- 
ments of equilibrium and therefore of an optimal allocation of resources. How- 
ever, it seems to me unnecessary and undesirable to put price stability as an 
end in itself. This can only be related to some idea that price instability—say 
inflation—has undesirable consequences in addition to its efficiency defects. 
One might be the undesirability of uncertainty, but from this point of view I 
think that stability of real income would be more important than stability of 
prices. Another would be, of course, the distributional effects of inflation or 
deflation, but then these should be studied directly, not hidden under a slogan 
of price stability. 

Aim (e) is rather opposed to some formulations of the classical liberal posi- 
tion though not to most modern practice. The issue of freedom of choice in 
economic affairs and particularly freedom of enterprise seems to have three 
components. One is the position that freedom leads to efficiency. This dees not 
create any special category for freedom as a value but makes it purely instru- 
mental to the achievement of maximum income. The second is the political 
view that interference with freedom of enterprise and economic choice will lead 
to interference with freedoms in other areas of life, such as political and social. 
This is a historical question which deserves careful study and to the best of 
my knowledge has not received it. Despite the obvious example of the Soviet 
Union, superficial examination does not suggest that there is an invariable cor- 
relation between economic and political freedom. The third argument is that 
freedom of choice is in itself a good thing somehow related to the virtues of 
individualistic self-reliance. This is again a matter in which a more explicit 
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debate than I have seen would be worth while. The issues, of course, being 
ethical, are not capable of completely objective solution. I have a good deal of 
sympathy for this last point of view but I think the matter is more complex 
than is sometimes supposed. Man is a social animal and the freedom in question 
must extend to organizations of various kinds, not excluding labor unions, for 
example. Such a broad concept of freedom means that there are internal con- 
tradictions and no simple view of freedom of individual choice will completely 
resolve the issue. 


2. QUANTITATIVE ECONOMIC POLICY 


In this section, Tinbergen deals with policiesdescribable as variation of numeri- 
cal magnitudes, such as government expenditures. Almost exclusively he as- 
sumes that there are fixed targets and does not actually discuss maximization 
problems. He starts with a complete model which is designed to explain a set 
of economic variables including the targets, in terms involving the instruments 
(policy variables) and other data of the system. The analytic problem is to 
solve for the targets and other variables the terms of the instruments; the policy 
problem is to fix the targets and solve for the instruments, eliminating the 
other variables in the process. If a linear model is assumed, it is immediately 
clear that the number of instruments must—except for special cases—be at 
least as great as the number of targets. This remark, extremely elementary in 
itself, is nevertheless highly illuminating in suggesting the way out of such post- 
war dilemmas as reconciling full employment with balance-of-payments 
equilibrium. 

A variety of models is studied. In each model, the number of instruments is 
equal to the number of targets. The first group have full employment as one 
target; monetary equilibrium is sometimes taken as a second target. Various 
combinations of instruments are considered in different models. Government 
expenditures appear in all but others include direct and indirect taxes, and 
wage rates. The simpler models are purely Keynesian in the sense that prices 
do not enter, but in more complicated ones both real and monetary quanities 
enter. In all cases, the problem is considered not only theoretically but also 
numerically, using plausible values of the coefficients derived from econometric 
work. In all cases, the models deai with short-run problems and lack dynamic 
elements. Tinbergen broadly concludes that as far as employment objectives 
are concerned, the main instruments are public expenditure and taxes which 
can be used in varying combinations to achieve the same results. Factor prices 
are very poor instruments for this purpose; even the direction of their effect is 
not clear. As far as monetary equilibrium is concerned, the wage rate is the 
preferred instrument; the interest rate is discarded as having only unimportant 
effects. Tinbergen indeed places a great deal of importance on control over the 
wage rate throughout his book; such control has indeed been a major feature 
of Dutch economic policy. Indeed, it seems to have been true that the basic 
negotiations about the general structure of wages have been greatly influenced 
by the calculations of the Central Plan Bureau as to the economic implications 
of the different possible wage rates. The very frequency with which Tinbergen 
recurs to this theme suggests a certain uneasiness on his part—not, I think, as 
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to the economic effectiveness of a control but as to possible interference with 
freedom. He argues that the usefulness of the wage rate in controlling price 
movements and also in controlling balance of payments problems is so great as 
to outweigh other considerations. Classical monetary policy is not considered 
seriously. Indeed, the possible effect of the stock of cash balances on the price 
level is nowhere considered seriously, although Tinbergen points out how poor 
our knowledge in this area is and urges further research. He does concur with 
the common judgment that restraints on credit may be effective in inflation but 
that supplying, of additional credits during a deflation will have little effect. 

He then proceeds to a study of policy in open economies. One of the more 
complicated models of this type is considered first with four targets: full em- 
ployment, balarice of payments equilibrium, a prescribed volume of investment, 
and a given distribution of income (between workers and others). There are 
four instruments: government expenditure, wage rate, indirect tax rate, and 
direct tax rate..The distribution of income is prescribed in the sense of a given 
ratio between labor and nonlabor income. This case is methodologically very 
interesting because despite the equality between the number of instruments and 
the number of targets, the problem is not, in general, capable of solution. It 
may be interesting to see how this conclusion is reached. First of all, foreign 
prices for imports are taken as given and may be used to measure all real mag- 
nitudes. Both imports and labor are assumed to be technologically determined 
by the volume of production. Hence the full employment target can perfectly 
determine the volume of production which in turn determines the volume of 
imports. Assume, for simplicity, that balance of payments equilibrium means 
a balance of imports and exports. Then the money value of exports is deter- 
mined. As exports are taken to be a function of prices, this determines the 
price level and consequently the real volume of exports. It is assumed that the 
price level depends upon the volume of production and the wage rate, which 
is an instrument. Hence the wage rate is now determined so as to achieve the 
given price level. The wage rate and the volume of employment together 
determine the wage bill. Nonlabor income is the difference between the product 
of the volume of production and the price level and the sum of the wage bili 
and the volume of imports. All of these have been determined so that both 
labor and nonlabor income are, in effect, determined by the one instrument— 
wage rate. It is impossible, therefore, to set a ratio between labor and nonlabor 
income as a target. 

One other model studied in connection with open economy is of special 
interest. Assume that the wage rate is taken as given. Then it is shown that 
an increase in productivity may, in quite realistic circumstances, lead to a de- 
terioration both in employment and the balance of payments. This rather 
startling result is due basically to the fact that the demand for exports is not 
infinitely elastic. However, the example given assumes an elasticity of demand 
of two, which is by no means unrealistically low. Of course it is true that a 
change in the wage rate would change the picture. In general, in regard to open 
economies Tinbergen stresses, even more than in the case of closed economies, 
the importance of the wage rate as an instrument. This is because of the effect 
on the price level and therefore on exports. 
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Some general remarks about the models may be made. In the first place, they 
tend on the whole to be rather simple macro-economic models. There is cer- 
tainly little differentiation among industries. Second, except in a few cases, the 
models are strictly short-run. In the exceptions, an attempt is made to pre- 
scribe the volume of investment or the stock of cash balances, in both cases for 
the presumed effect on the future on productivity and the price level, respec- 
tively. Third, again with minor exceptions, there is no maximization of some 
economic variable such as real output per capita, but only prescription of cer- 
tain targets. 


3. QUALITATIVE ECONOMIC POLICY 


Tinbergen begins, but perhaps does not push very far, the study of what he 
calls qualitative policy, where the institutions and methods of economic control 
are altered. He distinguishes between changes in the structure which still retain 
the basic foundations of the economic order and the more drastic reforms which 
involve changes in the foundations. Examples of the former are quantitative 
restrictions, built-in stabilizers and changes in the types of taxation, monopoly 
policy, administrative centralization or decentralization, and appraisal of in- 
vestment projects. The category of reforms includes social security schemes, 
guarantees of minimum income or minimum employment, equalization of op- 
portunities, monetary reforms, centralization of production decisions, industrial 
democracy, and nationalization. The discussion of most of these points is rather 
cursory though containing many wise remarks born of experience and obser- 
vation. 

He presents an appraisal of built-in stabilizers in a simple model of the busi- 
ness cycle, one which stresses the role of profits due to changing prices in in- 
fluencing spending (consumption or investment). He points out that one possi- 
ble government policy is to change the marginal propensity to spend by 
stabilizing its own pattern of expenditures or indeed by compensating for others 
and shows quickly how this will lead to a great reduction in the amplitude of 
the business cycle. 

The appraisal of investment projects relates mainly to such items of expendi- 
ture as roads or irrigation. Here he actually uses a maximization approach and, 
as perhaps might be expected, introduces shadow prices or—as he calls them— 
accounting prices to decide on the optimal allocation of resources. The analysis, 
though brief, is certainly excellent as far as it goes but Tinbergen has not, 
perhaps, faced all the difficulties that some of the modern writers on evaluation 
of benefits from public projects have had to face. 

In the discussion of reforms, perhaps the most interesting contribution is a 
new model of a distribution of income which seeks in a general way to relate 
it to the equalization of supply and demand and at the same time has some 
affinity with stochastic process models. This model is too complicated to be 
explained here, and it makes some striking assumptions. It would, however, 
take a considerable amount of work to reduce it to an operational form where 
it could be tested against data. 

In the analysis of monetary reforms, particular interest is paid to a raw 
material standard. He uses the business cycle model described previously to 
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show how the reduction in speculative opportunities would reduce the ampli- 
tude of the business cycle. However, he does not refer to the extensive literature 
on the practicability and effects of the raw material standard, such as the papers 
of Shaw [6] and Friedman [4].! 

Beyond the limits of reforms, Tinbergen discusses what he calls utopias. The 
discussion of the utopias is, however, very brief. 


4. ORGANIZATION AND EXECUTION OF ECONOMIC POLICY 


Tinbergen concludes his book by an exceedingly brief discussion of problems 
involved in organizing economic policy and in carrying it out. He suggests 
tentatively, but does not push very far, the idea that the organization of 
executive agencies should correspond in a natural way with the relation be- 
tween instruments and policies. This would be true, for example, if certain 
instruments were especially related to certain market variables. He stresses 
especially the need for coordination of decisions. Partly this would take the 
form of the making of fundamental decisions by a coordinating group, and 
partly that of instructing each agency to act on the basis of a specified common 
set of assumptions about the economic situation. Although Tinbergen expresses 
himself as being aware of the values of decentralization, the general trend of 
thinking is to emphasize the need for central control. Apart from other issues, 
this raises the serious question of the span of control possible to single officials 
or agencies. One suspects that in many cases centralization can be an illusion. 
Putting more responsibility on the central agency leads to an increase in the 
size of an agency and the difficulties of coordination within the agency replace 
those outside. It is very possible that the small-scale problems of the Dutch 
economy do not reflect all these difficulties, which perhaps would be more 
apparent in the United States or the Soviet Union. 

Tinbergen briefly but strongly emphasizes the importance of supervision of 
economic policy. This involves the question of data to check on the performance 
of the executive agencies. As he rightly stresses, such data are useful not only 
as control but as an aid in the termination of new policy. The importance of 
such supervision seems to me to be extremely important and in many respects 
given insufficient attention in the past. 


5. WELFARE ECONOMICS AND ECONOMIC POLICY 


Tinbergen, at an early stage in the book, discusses briefly and broadly ac- 
cepts the central proposition of welfare economics that under suitable hypothe- 
ses the competitive price system leads to an optimal resource allocation. He 
stresses among the qualifications that of full employment. Although the dis- 
cussion is not too explicit, one may argue that Tinbergen takes economic policy 
to be the aim of repairing defects in a price system. This point of view would 
in many respects justify the fact that Tinbergen’s models, for the most part, 
confine themselves to achievement of specific targets such as full employment 
and monetary equilibrium, rather than the maximization of real output per 





1 At another point in discussing monetary reforms. Tinbergen uses the term “one hundred per cent money” 
to refer to a one hundred per cent gold cover for money. Possibly this terminology is used in Europe, but in this 
country the proposals of Irving Fisher and Henry Simons for one hundred per cent money have a very different 
meaning—one which, incidentally, could well be studied along the lines of Tinbergen’s type of analysis. 
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capita. One can say that he feels that a government policy can be confined to 
the former ends because the latter is being taken care of by the private sector. 

Tinbergen does not take much account of the two classic justifications of 
welfare economics for government interference: economies of scale and the 
absence of markets to mediate external effects in certain areas. The former 
leads to the questions of natural monopolies which may be met by nationaliza- 
tion. This leads into the whole area of products of nationalized enterprise which 
has occupied a prominent role in the theoretical literature of welfare economics 
and some, particularly among the French writers such as Massé, Boiteux, and 
Allais, on the practical side. This problem does not seem to have been raised 
in Tinbergen’s work in the Netherlands. 

The question of external economies is, in my opinion, mainly of importance 
in regard to investment. The classic point is that there is an inadequate number 
of future markets to achieve the allocation of resources over time that is re- 
quired by the theory of resource allocation. This is the basic reason why some 
justification can be given for government planning in economic development 
(see, e.g., Chenery [2]). The argument is not, of course, a conclusive one. It 
depends on whether the possibilities of coordination of knowledge by the gov- 
ernment will produce anticipations of the future which are better than the price 
anticipations of individual entrepreneurs. It would seem, however, that one of 
the primary aims of planning must involve some consideration for the future 
which will take the form of determining an optimal value for investment as 
well as an optimal distribution of it. The possibility of using the theory of the 
optimal allocation of resources as a practical device for such forward planning 
is only beginning to be studied. Tinbergen, indeed, in a discussion of the ap- 
praisal of investment projects touches upon a special case of this and outlines 
very satisfactorily the general approach. 

Welfare economics perhaps could be used in another way in connection 
with planning, this time on the negative side. It would suggest that some instru- 
ments not be used because of the resulting inefficiency in the use of resources. 
In particular, fixing of the wage rate would seem to involve at least some of 
the difficulties usually faced in price fixing of any type. This is in addition to 
the problem of how the macro-economic determination of an average wage 
rate is actually translated into the determination of wage rates in the countless 
bargains of actual industry. It may well be that the attempt to make individual 
wage rates conform to some national average will not only interfere with the 
flexibility of the national wage rates but also with the flexibility of relative 
wages and so interfere with the allocation of labor among industries. The use 
of taxation, particularly indirect taxes, has well-known drawbacks, though it 
should be added that these are taken account of by Tinbergen informally rather 
than through his models. Also government expenditure appears mainly as a 
regulator of the level of employment. Of course, when there are unemployed 
resources, one may argue that the marginal social cost is zero and therefore 
there is no particular objection to the use of government expenditures; still it 
would be desirable to have some theoretical treatment of the allocation of gov- 
ernment expenditures both as to type and as to the choice between private and 
public control of expenditures. 
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6. ECONOMIC POLICY AND UNCERTAINTY 


Tinbergen in no way denies the fact that effects of economic policy are cer- 
tainly far from completely predictable. We find that economic policies are sub- 
ject to change and as a matter of standard practice are reviewed annually. 
These two facts put economic policy in the category of sequential decision- 
making or dynamic programming, like inventory theory or sequential analysis 
of statistical data. Such a situation calls for a flexible policy—that is to say, 
one in which the policy decisions each year are to be a function of all that has 
happened in the previous years. Furthermore, in making up a policy for the 
future, one should be aware of the fact that the future policy will be similarly 
flexible and that there will be opportunities for self-corrective measures. Indeed, 
this property of self-correction is of course one of the most important attributes 
of the price system and any planning mechanism which seems to substitute 
for part of it must have the same attributes. The review exists operationally ; 
a new model is constructed each year, but the theory does not take account of 
this review. 

It is the fact that new information is accumulated over time that makes 
policies based exclusively on built-in stabilizers less than optimal in principle. 
The action taken should always reflect in some way the latest information 
available. It is already clear from inventory theory that a policy which takes 
the future possibilities of learning into account will be qualitatively different 
from one which ignores them. The government must consider itself as a response 
mechanism which reacts to disturbances from optimal performance in such a 
way as to counteract them. This point of view has several policy implications. 
First, there will usually be a benefit derived from very fast reactions. To 
achieve these, it will be necessary usually to speed up the flow of information; 
it may also be necessary to act on the basis of such information as is quickly 
available. Second, there is a value to reserves, e.g., of foreign exchange, to 
buffer the impact of changes in external conditions, especially those of a random 
nature. Third, the analysis of the model should lead to associating an economic 
index (a composite of the economic variables) with each instrument, so that the 
instrument is varied in accordance with the movements of the index. If govern- 
ment expenditure is associated only with employment, for example, then the 
government should arrive at a reaction function between the two; but a more 
general analysis would suggest perhaps associating government expenditure 
with an index which incorporates both employment and the price level. 


7. THE WORKINGS OF ECONOMIC PLANNING 


We need much more knowledge on the actual workings of the process of 
economic planning, its administrative and political aspects, and its effect on 
the economy. Devons [3] has given us a rather discouraging picture of war- 
time economic planning, as seen from the inside, but the situation is hardly 
typical. There have, to the best of my knowledge, been no adequate studies of 
the actual day-by-day methods by which plans are made and carried out. 
Marris [5] has given us some insight into the Dutch experience, and there is an 
excellent (though by now dated) paper by Bjerve [1] on the Norwegian ex- 
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perience. It is very much to be hoped that Tinbergen or one of his colleagues 
will write a careful account of Netherlands planning. What are the thoughts 
and aims of the planners? How do they deal with other government officials? 
To what extent are actual government policies a reflection of the plan and to 
what extent is the behavior of the economy the result of government policies? 

Finally, it must be asked to what extent has the influence of the plan on the 
economy been in the direction of its intentions? For example, to what extent is 
the rather considerable success of the Dutch and Norwegian recovery after the 
war been due to the highly self-conscious, intelligent work of the planning 
bureaus? Some studies have been made of the forecasting ability of the planners 
(see Theil [7]), but this is not the same as the success of the plan itself. Sup- 
pose, for example, that the forecasts of the government are no better than those 
made from a naive model which asserts that the rate of increase of national 
income is constant. This does not disprove the ability to plan; it may be that 
the planning has been so successful as to remove the cyclical fluctuations which 
would cause the naive model to fail in forecasting. There is need for ‘consider- 
able research in criteria for the evaluation of planning: 
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ON THE RELATIVE ACCURACY OF SOME 
SAMPLING TECHNIQUES 


Des Ras 
American University of Beirut 


The problem considered is estimation of the mean value of a character 
for a finite population, making use of information on an auxiliary 
character. A comparison is made between the pps estimate and the 
simple average, ratio and regression and stratified sample estimates. 
The results are obtained under the assumption that the finite population 
is a random sample from an infinite population possessing certain 


properties. 





1, INTRODUCTION 





N sampling designs the use of supplementary information is known to im- 
] prove considerably the accuracy of the estimates. But there are generally 
several ways of making use of auxiliary information. It then becomes important 
to sti:cy the relative accuracy of sampling designs using such information. 
Such vestigations, however, become difficult when we are concerned with 
finite populations where no functional form of the distribution followed by the 
data is assumed, as is the practice in sample surveys. One way out of this dif- 
ficulty, as suggested by Cochran [1], is to regard the finite population as drawn 
at random from an infinite super-population which possesses certain properties. 
The results obtained do not apply to any single finite population but to the 
average of all finite populations that can be drawn from the infinite population. 
This approach has been used in this paper to study the relative accuracy of 
some sampling designs. Some results in this direction were obtained by the 
author in Des Raj [2]. 


2. COMPARISON OF pps AND EQUAL PROBABILITY ESTIMATES 
Let there be a finite population containing the N units 
Ui, U2, aa ei, Uy. (1) 


We want to estimate for this population the mean Y=1/N }-y; for a character 
y. We have available with us the values 


Xi, X2,---, Xw (2) 
of the N units for an auxiliary character z. The variances of the pps and equal 
probability estimates (where sampling is with replacement) are 

1 


ae abs re - vr], (3) 





aan - (4) 


where X and Y are the totals of the characters z and y for the finite population. 
98 
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We shall now assume that, for the character y, the finite population given by 
(1) is a random sample from an infinite population. The model assumed is that 








yi = Bri +e; (5) 
where 
E(e;|z) =0, Vie|2) =arv, g>O. (6) 
It is easy to see that, under this model 
a 
E(V yy.) = —~- [x > ee = ) x], (7) 
nN? 
-1 
E(Vq) = [a Do zv + 6*No,!] (8) 
nN? 
where 
1 
“" = fae X 3, 
ef « ——~ bee = F) (9) 


Thus the pps estimate is superior to the equal probability estimate if 
2 
X¥ Dire —- Vire < (N - 1) A o,? (10) 
a 
or 


N-1 ® a, 


Az, 2°) > — 
, N a os 





(11) 


where p(z, x*—') is the correlation coefficient between x and x*-!. Hence for g>1, 
the pps estimate is always superior. However, for 0<g <1 it will be so only if 
inequality (11) is satisfied. In the few studies that have reported data exhibiting 
the relationship between V(e,;) and 2z;, it is found that g lies between 1 and 2 so 
that in this situation the pps estimate is more precise than the one which does 
not make use of the auxiliary information. 


3. COMPARISON OF RATIO AND REGRESSION 
ESTIMATES WITH THE pps ESTIMATE 


In order to obtain usable results, we shall assume that g=0 in the mathe- 
matical model. This means that the variance of y’s in arrays determined by z 
is independent of xz. We shall also assume that 


Yi r= at Pate (12) 


so that the regression line need not be through the origin. Then 


E(V = */% ‘|+<[3 1| a3 
"= ntX WN n_X ) 


where X is the harmonic mean of z. 
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Using the large sample formula for the variance of the ratio estimate, we 
have 








Vout = — = Dd (yi — Rei)? 
so that 
E(V re) = <\* =p (< + =) ce (14) 
n N a 
where 


C,? = o,2/X?. 


Thus the ratio estimate is superior to the pps estimate if 


>i ter{1 “—(1+=)"] (15) 
xX N a ‘ 


In case a=0, the condition is 


. >1+ : Ct (16) 
x N 
Now C,?/N will be very small for large populations, so that inequality (16) 


will be found to hold fairly generally. 
With regard to the regression estimate, the large sample variance is given by 








‘oo 
Viawt— lem «9 17 
0 (. 5) ot p*) ( ) 
and 
pied lis tale (18) 
Oh, Etna eaiiinae 


Thus the regression estimate is superior to the pps estimate if 


7+=(5 sta “( N-2 : - 
tats ee Et oa par ). (19) 


Now the right hand side of the inequality is less than unity and the left hand 
side is greater than unity so that this inequality holds. Hence, under this model, 
the regression estimate is superior. 








4. STRATIFICATION AND pps SAMPLING 


We shall consider the case when the entire population is divided up into k 
strata and the size of a unit in the ith stratum is 2;. Let y;;=a+62z,;+e,; where 
E(e;;| z:) =0, V(exs| z;)=azx,. Then the variance of the stratified sample esti- 
mate is given by 











RELATIVE ACCURACY OF SAMPLING TECHNIQUES 101 





1 1 1 
Vu = aq 2 Need E _ +) = 
and 
E(Va) = — ey N?x# (— - : ). (21) 
N? i=l ni N; 


The expected values of the variances for proportionate and optimum allocations 
are given respectively by 








E(V pu) = — ee 2f —- — + x‘, (22) 
nN jini N? (21 
E(V — > xg 23 
one w? ri". ( ) 


The expected value of the variance for the pps estimate is 
E(Vpp:) = eX Dae — > ze]. 
Thus the pps estimate is superior to the stratified (proportionate) estimate if 
n—1 pu 
(1 ad ) > re > XY xe. (24) 


If (n—1)/N is negligibly small, the condition reduces to 
p(x, z-") > 0. (25) 





Hence for g>1, the pps estimate is superior to the stratified (proportionate) 
estimate, while it is inferior for 0<g<1. Again, the stratified (optimum) esti- 
mate is superior to the pps estimate if 


(D2) re) + (n- 1) YD ze > (E ze!?)*. (26) 
Now, by Cauchy’s inequality we have 
(Do m)( Qo ze?) > (Lo ze)? (27) 


so that inequality (26) holds for all g. Hence, for g>1, the pps estimate is 
superior to the stratified (proportionate) estimate, but is inferior to the strati- 
fied (optimum) estimate. 
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PROBABILISTIC INTERPRETATIONS FOR THE MEAN 
SQUARE CONTINGENCY 


H. M. Buauock, Jr. 
University of Michigan 
The purpose of this paper is to interpret ¢*, the mean square con- 
tingency, as a function of probabilities associated with proportional 
prediction and in so doing to provide some additional evidence on the 


difficulty of giving useful operational interpretations to measures of 
association based on chi-square. 


oopMAN and Kruskal [2] point out that measures involving simple func- 

tions of chi-square have not been given operational interpretation and 
that therefore it is difficult to compare meaningfully two different cross- 
classifications with respect to the values of measures such as ¢’, C, or Tschu- 
prow’s T. These authors give an interpretation for ¢? in terms of loss functions 
({2], 755-6) as well as probabilistic interpretations for some less well-known 
measures. The latter interpretations involve the relative probabilities of making 
an error in assigning individuals to a particular category of one variable, either 
knowing or not knowing the category of the second variable to which the 
individual belongs. The interpretations for ¢? presented in this paper are essen- 
tially very similar to probabilistic interpretations given in [2] to other measures 
of association and permit a direct comparison between ¢* and one of these 
measures, 7». 


INTERPRETATIONS FOR ¢° 


Following Goodman and Kruskal we confine our attention to populations 
and make use of the following notation: 


B, Bz ---+ Bs | Total 
Ai Pils Piz * * * pig Pl. 





Az | pn pn-** pap Pr. 


Ae | Pet Per’ ** Pep | Po 
Total! ps p2::*ps 1 











where the ¢’s represent proportions of the total population. Therefore, for a 
finite population of size N the number of cases in any given cell (a, b) can be 
represented by Noa». 

In this notation chi-square is 





vended (pab — Pa.p.r)* 
a b Pa .P .b 
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It can easily be shown algebraically that 


_ Pab9ar 
x? ‘ 
bute Sa-14-f = 
N b b P.b 
where 0,; = 1—pas/pe., i.e. the proportion of p,, not in column B,.! 

Expressed in this form, ¢? lends itself to interpretations involving a propor- 
tional prediction model in which one compares the probabilities of errors made 
in assigning an individual to the proper B-class with and without knowledge 
of his A-class.2 We imagine that a person makes a large number of trials in 
which he attempts to assign individuals, chosen at random from the population, 
to the correct B-class subject to the restriction that he must place the proper 
proportion of individuals in each B-class. In other words, if he is given no 
information about the A-class he must assign the proportion p., of the cases to 
category B,, and if he knows the A-class he must assign pas/pa, of the cases in 
class A, to category By. Stated in terms of probabilities, he will select category 
B, with a probability of p., if he has no further information, but given that the 
individual is in A, he will choose category B, with probability pas/pa..* 

There are several ways of expressing ¢’ in terms of probabilities. Let us first 
define P(E,) as the conditional probability of making an error in assigning an 
individual drawn randomly from the population, given that the individual was 
assigned to column B,, and given that the method of assignment did not make 
use of information about the individual’s A-class. Clearly 


P(E,) = 1 — po. 


Also, let P4(E») be the conditional probability of making an error, given that 
the individual was assigned to column B,, and given that the method of assign- 
ment did make use of information about the individual’s A-class. Then 


Expected no. of errors in B, given A-class 
4 (Ep) _ 





Total no. of cases in B, 


} Npav(l — par/pa.) Dd pasOas 


Np» P.b 





An alternative but equivalent set of definitions can be given to P(E.) and 
P4(E,), providing a second interpretation for ¢*. Let P(E) be the conditional 
probability of making an error in assigning an individual drawn randomly, 
given that the individual was in fact in column B,, and given that the method 
of assignment did not make use of information about the individual’s A-class. 
Similarly, P4(£») is the conditional probability of erroneous assignment, given 





! This can be seen by noting the contribution to chi square of each cell, adding the contributions of the cells in 
each column and simplifying, and then summing over b. 

2 By symmetry ¢? can also be interpreted in terms of predictions of A-classes with and without knowledge of B. 

3 See Goodman and Kruskal ([2], 759) for a more formal discussion of proportional prediction. A distinction is 
made in [2] between proportional prediction and optimal prediction in which there are no restrictions placed on 
the relative number of times each category can be selected. Under optimal prediction a person always selects the 
By with the largest marginal if he does not know A and the cell with the largest pap given Ag. 





Aa ty 
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that the individual was in fact in B,, and given that the method of assignment 
did make use of information about the individual’s A-class.‘ 
Since 


> P(E) = DY (1 - px») = 8-1 
b b 





and 
ys Pad9as 
ZF AR).© 26. 
b b P.b 
we have ' 
¢* = X P(E) — re Pa(Es) 
and 
> P(Es) — 2) Pa(Es) § 
¢? b b 
p-1 > Pe) 


Therefore, under the proportional prediction model ¢? can be interpreted as 
the difference between the sum of the probabilities of error not knowing A and 
the sum of the probabilities of error given the A-class. Thus ¢? is a function of 
probabilities each of which can be given simple interpretations. There are cer- 
tain difficulties apparent when one attempts to give intuitive meaning to such 
a function of probabilities, however. The sums involved in the expression for 
¢* are sums of conditional probabilities under different conditions, and therefore 
these sums are not in themselves probabilities nor are they <1.° 


COMPARISON WITH 7), 


The inadequacy of this kind of interpretation for ¢* can be made more appar- 
ent if we compare ¢ with Goodman and Kruskal’s 7, which is defined as 
follows: 


D> Dd pas?/pa. — D> ps? 
a b b 
“ty Bes ; p.»* 
b 


As indicated above, the interpretation for ¢ involves a summation of condi- 
tional probabilities under differing conditions. Legitimate probabilities could 








4 It can be seen that the two sets of definitions are equivalent. If the individual were actually in By the prob- 
ability of assignment to some other column is 1—p» given no knowledge of the A-class, and similarly it is 
Zeatfad/P.b given knowledge of the proper A-class. 


5 Cramer ([1), 282) shows that ¢?<Min (a—1, 6—1), from which it follows that ¢?/8—1 may reach an upper 
limit of unity if 8 <a and if certain conditions on the marginal totals obtain. If a <8 and if all p.4>0 it will be im- 
possible to reduce to sero the probability of error in assigning individuals to B-classes. 

* By dividing each sum by 8 we can obtain arithmetic means of numbers each of which, under the second set 
of definitions, is a conditional probability of erroneous assignment given that an individual was actually in a par- 
ticular By. A simple arithmetic mean would seem to make sense, however, only if all p., were equal or if loss functions 
were attached to columns in such a manner as exactly to counterbalance unequal marginals. 
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have been obtained by multiplying each conditional probability by the proper 
p.» and then summing. Had these unconditional probabilities been summed we 
would have obtained 


Dd pw(l — ps) =1— DY pa? 
b b 


and 


p Par(l — Pab/ Pa.) 
Dp.» * 3 1— D0 Dd por*/pa. 


b P.b 





Subtracting the second probability from the first and dividing by the first we 
obtain precisely 7,. Therefore, r, can be interpreted as the relative decrease in 
the proportion of incorrect predictions over the whole table when information 
about A is used. 

Thus the interpretation of 7, is considerably more straightforward than that 
of ¢*. We see that 7, gives the proper weight to the conditional probabilities, 
whereas in the interpretation for ¢’ they are all given equal weight regardless 
of the size of the column totals. The arbitrary nature of ¢? becomes more obvi- 
ous when we notice that the smaller the p.» the larger the conditional probability 
of erroneous assignment. Therefore, the largest contributions to 2P(Ey) come 
from the columns with the smallest marginals.” 


CONCLUSION 


Although it is possible to give interpretations to ¢* which involve simple 
functions of probabilities, these same interpretations help to indicate the arbi- 
trary nature of the measure. Therefore it appears that a measure such as 7; is 
generally to be preferred ever ¢? or other measures based on chi-square. 





7 An interpretation for ¢* in terms of a weighted average of the ratios of explained to total variances leads to 
essentially the same conclusion. The weights turn out to be proportional to 1—p.», giving greatest weight to the 
smallest marginals. 





i 





FITTING STRAIGHT LINES WHEN ONE 
VARIABLE IS CONTROLLED* 


Henry Scuerré 
University of California, Berkeley 


Experiments are analyzed in which “runs” are made on the appara- 
tus, the data are bivariate, observations on one variable are taken for 
a predetermined set of values for the observations on the other variable, 
and straight lines are fitted on each run to the observed points. The 
possibility is considered that the “true” line may vary from run to run 
because of uncontrolled factors causing differences between runs. The 
mathematical model generalizes one introduced by Berkson. 


I. INTRODUCTION 


ables, usually in the physical sciences and frequently in some other 
sciences, a set of values is chosen for one of the variables, this variable is brought 
to the chosen values—more precisely, the variable is adjusted so that the ob- 
servations on this variable equal the chosen values—and the corresponding 
values of the other variable are measured. We shall denote the first, or con- 
trolled, variable by z, and the second, or dependent variable by y. The important 
distinction between this case and another in which the observed pairs (z, y) 


Wa experiments are made to investigate the relation between two vari- 


are sampled from a single bivariate distribution was treated in a basic paper 
by Berkson [2]. The present paper is limited to applications where straight 
lines are fitted to the data. It employs a mathematical model which the writer 
believes to be widely useful, and which is a modification and generalization of 
Berkson’s model.' 

Three types of problems commonly arise, depending on whether the char- 
acteristic of primary interest on a fitted line is 


(i) the slope, or its reciprocal, 
(ii) the ordinate at a given x=2o, or 
(iii) the abscissa at a given y= yo. 


Solutions of such problems useful for many practical purposes can be based on 
formulas which give (a) the observed characteristic (i), (ii), or (iii) as an esti- 





* This paper was prepared with the partial support of the Office of Naval Research. This paper in whole or in 
part may be reproduced for any purpose of the United States Government. 

1 A brief indication of the overlap of this paper with the work of some other writers who have treated con- 
trolled variables subject to error may be of interest: Lindley [5] intends as I do in sec. II to “give a mathematical 
justification of Berkson's procedure,” but uses a notation such that questions about the model like the one I discuss 
in connection with (13) do not emerge. Geary [4] says that “a theory of estimation and of tests of significance is 
sketched for the case of controlled experiments when the inh t two variables is non-lixear,” 
but actually he extends Berkson’s model to the case where the true value of y is a cubic polynomial in the true 
value of z, and attempts a statistical analysis under this model. T. W. Anderson [1] considers two different but 
related models applicable respectively to experimentation with and commercial operation of an industrial process. 
The latter model resembles Berkson's with some extra terms in the model equation for the dependent variable, for 
dependence on additional controlled variables not subject to error. The coefficients in this equation are functi 
of those in bivariate regression equaticns for the former model where the coefficients can be estimated by standard 
least squares methods. The problems solved concern prediction under the latter model from data assumed to con- 
form to the former. None of these writers treat the coefficients in the equation relating the true values of y to the 
true values of z as random variables, as I do following sec. II. 
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mate (i.e., unbiased estimate) of some conceptual quantity in an appropriate 
mathematical model, (b) the variance of this estimate, and (c) an estimate of 
this variance. We give such formulas for two mathematical models, one of 
which is a simple special case of the other, and statistical tests for whether the 
simpler model fits. 

We shall speak of making “runs” in the experiment, and we shall assume that 
for each run there is a conceptual line to be defined below, and which will be 
called the “true” line for the run. While this true line is assumed to remain 
fixed during a run, we allow the possibility that to another run there may corre- 
spond a different true line. Thus, in the petroleum refining field, in a laboratory 
experiment for the comparison of different catalysts, if suitable variables are 
used, the observed points in a single run may seem to follow a line quite closely, 
and we may be willing to assume there exists a true line. However, on another 
run the true line may be different: This difference might be caused by the use 
of different catalysts in the two runs. However, we also conceive of a possible 
difference of true lines even if the same catalyst is used and it is attempted to 
maintain the same conditions, because of possible uncontrolled factors causing 
the runs to differ. 


II. CASE OF A SINGLE RUN 


We consider in this section only what happens during a single run. This 
might be the only run, or the ith of a set of runs, made under the same con- 
ditions. It will be convenient in the notation to adjoin a subscript 7, which will 
make the results useful in the latter case and which can be discarded in the 
former. 

If J; points are observed during the run (J;22), the values chosen for the ob- 
servations on the controlled variable z will be denoted by 2;; (j=1,2, ---,J43), 
and the corresponding values observed for the dependent variable y will be 
denoted by y;;. We assume that there exist true values (X,;, Y;;) of the variables 
from which the observed values (2;;, y;;) differ by random errors (u;;, v;;), 


tig = Xj + uss, (1) 

ys = Ves + 055, (2) 
and we assume that the true values satisfy a linear relation in the experimental 
range, 

Yas = a; + 0X5, (3) 
where the coefficients a; and b; remain fixed during the run. We shall call the 
line 

y =a, + dx (4) 


on which the true values lie, the true iine for the ith run. 

In (1) the error u,; is a random variable but the observation 2;; is not, since 
it is forced to be equal to a predetermined value. Hence the true value X;,; is a 
random variable because of (1), and the true value Y;; is also a random variable 
because of (3). We remark that while the model is notationally similar to cer- 
tain others, used for example in economics, involving a linear relation and 
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errors in both variables, the distribution assumptions are very different: in- 
stead of the error u,; and the true value X;; being uncorrelated their correlation 
coefficient is —1. 

If we substitute (1) and (2) into (3) we get 


Yy = ay + Dari; + e345, (5) 

where 
Ci = Vj — Duy. (6) 
The relation (5) is of the form assumed in the theory of least squares, when a 
line is fitted to the data by minimizing the sum of the squares of the distances 
of the points (z,;, y;;) from the fitted line and the distances are measured paral- 
lel to the y-axis, that is, perpendicular to the axis of the controlled variable. We 
shall assume a line is fitted this way no matter which of the above three types 
of problem concerns us. If the fitted line is denoted by 
y= 4;+ bx , 

then the coefficients 4; and 6; are determined by minimizing? 


p> (sj - &— b.2x;;)*, 


and are given by the formulas 


b; = \ (255 — 23.) ¥si/Si, 


>] 


4; = yi. - bix:., 


S; = > (ty — %.)?, 


t= >) ty/Ji, ys. = D yii/Ji 
i i 


On substituting (5) into (7) we get 
6 = b + SH DO (x5 — tides 
j 


From (5) we find 
¥i = a + Oa. + eG, 

where e;.= )_; e;;/J,, and substituting this and (9) into (8), we have 

4,=a:te,, — a8o* Do (ty — aes. (10) 

i 

From (9) and (10) together with (6), we see that regardless of the joint dis- 
tribution of the 2./; errors { uss} and {v5}, providing only that all the errors © 
have zero means, the coefficients 4; and 6, of the line fitted in this way are un- 


biased estimates of the coefficients a; and b; of the true line (4): this is Berkson’s 
result. We shall write this result as follows: 





1 The index of summation j everywhere runs from 1 to Jj, the index é (in Parts III-VII) from 1 to I. 
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E(u) = 0, E(v,;) = 0 


implies 


E(4;| t) = a, 
E(5; | t) = bi; 


(12) 


the notation in (12) is an abbreviated conditional expectation notation, “| i” 
meaning “given the values of a; and b;.” To justify the use of the conditional 
notation for means, variances, and covariances in this section we mention here 
that when we later consider what happens when more than one run is made or 
imagined we shall be especially interested in two cases: 


(I) the values of the coefficients a; and b; of the true line are the same on 
every run, and 

(II) the values of a; and 6; are values taken on in the 7th run by random 
variables (a, b). 


In case (I) the conditional notation is a formality and the values denoted with 
it are actually unconditional, but the notation permits covering both cases 
with a single set of formulas. 

We next assume in addition to (11) that the errors {ws} and { vis} are inde- 
pendently distributed, that the {u;;} have equal variances, likewise the {»,;}, 
and we let 


ou = Var (uj), 90” = Var (0)). (13) 


Our model now differs somewhat from Berkson’s, for our assumptions imply 
that if a subset of the measurements is made with the same observed value 
for the controlled variable, say x;;=2’ in this subset, then the corresponding 
errors {u;;} are statistically independent, whereas one of his equations* implies 
that he assumes the {u,;} for this subset will be identical but the {v,;} inde- 
pendent (or at least uncorrelated). Which model is appropriate depends on 
how the replicated measurements are made: We suppose that the repeated 
measurements are made by changing the controlled variable from its previous 
value when the observation was x’ and then bringing it back so that the ob- 
servation is again x’, while he apparently supposes the controlled variable is 
not changed. We both assume new observations on the dependent variable y 
in the repeated measurements. Thus if y is measured by sampling a flow and 
having a chemical analysis made for y, we both suppose a new sample with a 
new chemical analysis on each repeated measurement. We remark that if the- 
error in the controlled variable is caused by a bias in the instruments measuring 
it (such as would be removed by use of a calibration curve), then while it is 
true that the error u;; would be the same on each of the replicated measure- 
ments, Berkson’s model would not fit either, since he also assumes E(u;;) =0. 
From (6), 


Var (e4;| i) = 07.4, 





3 His equation (16) in [2]. 
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where 
o*..¢ = op? + b,%e,?. (14) 

The usual least squares theory tells us that 

Var (4;| i) = os.(Ji7? + Siz:.%), 

Var (5;| i) = o°,.:/S,, 

Cov (4;, b, | t) = — o*, «t;./S8i, 

and that if J;>2, 


4 = > (yis — 4; — bxi;)?/(Js — 2) (16) 


is an unbiased estimate of o?,,; in the sense that 

E(é?,,;| t) = oi. (17) 
If we add the normality assumption, i.e., that the errors { wis} and {v,;} are 
normal, the usual confidence intervals or confidence ellipsoid are of course 


available for a; and },. 
If we denote by n; the ordinate on the true line (4) at a given r= 2», 


ni = a; + dio, 
and by 7; the ordinate on the fitted line at r=2p, 

ix = 4; + bere, 
we find 

E(is| te) = 08, (18) 

Var (7; | i) = o*, «(J + Sx — x;.)*]. (19) 
Let us denote by &£; the abscissa on the true line (4) at a given y=yo, 
= (yo — a;)/d;. 


While an exact confidence set is available for £; under the normality assumption, 
we shall instead use some simpler approximations. Let 


E: = (yo — 4,)/b;, 


so £; is the abscissa on the fitted line at y= yo. Then if Var (4;| 7), given by (15), 
is small compared with b,?, we have the approximations‘ 





‘ The interpretation of (20), (21), and similar approximations obtained from the commonly used formulas 
(22) and (23), is a little delicate: The means and variances symbolized on the left of (22) and (23) may not exist. 
Indeed, this is the case with e; in the above application if a; and 5; have a nondegenerate bivariate normal distribu- 
tion. In the notation of (22) and (23), if a, 8 have means a, b, and finite variances, and if the function z =/(2, 5) has 
a tangent plane at (4, }) =(a, b), then the expressions on the right of (22) and (23) are the mean and variance of a 
random variable approximately equa! to Sia, b), the approximating random variable being the linear function of 
&, § corresponding to the tangent plane to the surface z =f(a, 5) at (a, b). The reason that in the above application 
I add the condition that Var (b;| i) be small compared with ;? is that in that application the function f(a, b) becomes 
infinite along the line 6 =0, I want a small probability of the point (2, }) falling near or beyond this line where the 
linear approximation is so bad, and this probability will be small by the Tchebycheff inequality if |b/o;| is large. 
If f(a, b) does not have finite variance but is asymptotically normal (in J, where a and and depend on J), then the 
mean and variance of the asymptotic normal distribution are ordinarily given by the right sides of (22) and (23). 
In the above application to £;, (a, 6) can be shown to be thus asymptotically normal under mild assumptions, by 
using methods of Cramér [3], sec. 28.4. 
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E(é| i) ~ &, 
Var (€;| 1) ~ obs? [J + S(x;. — &)*], 


obtained from the formulas 


E(f(4, 6)) ~ f(a, b), 
Var (f(4, 6)) ~ f.? Var (4) + fo? Var (6) + 2fafs Cov (4, 5), 
where a= E(4), b=E(5), fa=af(a, b)/da, and f,=af(a, b)/ab. 


III. CASE OF SEVERAL RUNS; TRUE LINE POSSIBLY VARYING 


We consider now case (II) mentioned above, in which the values of a; and 
b; are the values taken on in the ith run by random variables (a, 6), so that for 
different 7 the pairs (a;, b;) are independently and identically distributed, and 
we let 


a= E(a,), B= E(b,), 


(24) 
o,”? = Var (a;), o.? = Var (b:), oa, = Cov (ai, By). 


In addition to the previous assumptions on the errors {u;,} and {v,,} stated 
in connection with (13) and now made for all i, we assume further that they 
are statistically independent of the { (ai, b,) i. We note that case (I) above is 
the special case of (II) in which o,?=0 and o,?=0. 

We may define a mean true line 


y =a + 82, 


about which the true lines for the runs fluctuate, and we denote by 7 the ordi- 
nate on this line at a given r= 2p, 


7” & + BXo, 
and by & the abscissa at a given y= Yo, 
& = (yo — a)/B. 


The conditional values calculated in sec. II may now be converted to un- 
conditional values by use of the formulas 


E(f) = E(E(f| h)), 
Var (f) = E(Var (f| h)) + Var (E(f| h)), 
Cov (f, 9) = E(Cov (f, | h)) + Cov (E(f| h), E(g| h)). 
The results are 
E(4) =a, Eb) =8, EG) =, Ee) ~é, 
Var (ei) = o.’, 
where 
a.” = a,” + (8? + on’) 0,?, 
Var (6;) = o,2S;-! + os?, 
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Var (4;) = o,2(J;-! + S;-'2,;.2) + o,?, (27) 
Cov (4; 6) =— o.2x;,Si-! + oes, (28) 
Var (9;) = o.2[Ji-! + SiMao — 2;.)?] + 0? + 2o%on? + 20008, (29) 
Var (€:) ~ B02 [Ji-t + Say. — 8*] + B* lee? + Hon* + keer]. (30) 


In obtaining the approximation (30) it is assumed that o,? is small compared 
with 6?. If J;>2, 


E(é?,.) = Ge. 


Formula (26) becomes simpler, and easier to apply in problems of type (i), 
if o,?=0; similarly for formulas (29) and (30), and problems of types (ii) and 
(iii), if o.2=0,?=0. In the next section we consider tests of the two statistical 
hypotheses involved. 


IV. TESTS FOR EQUALITY OF SLOPES OF TRUE LINES, 
AND FOR IDENTITY OF TRUE LINES 


The two hypotheses considered are 
H,: o,? = 0, 
and 
H;: o,? = 0, oy”? = 0. 


Whether or not we are willing to assume H, is evidently crucial for the analysis 
of problems of type (i); similarly for H, and problems of types (ii) and (iii). It 
might seem that for problems of type (ii) the relevant hypothesis should be 
that all »;=7, i.e., that the true lines all have a point in common which happens 
to be (zo, 7). If the true lines vary this would usually be a most unlikely oc- 
currence in the kinds of situation where the author has considered applying 
this model. For problems of type (iii), H2, rather than the hypothesis that all 
§;=£, would be similarly justified. 

Tests of H, or H; will be possible only if there is more than one run with 
more than two points per run. Standard tests for H; and H; are available if we 
add the normality assumption on the errors {u,;} and {v,;}. These tests insure 
the desired significance level, but their power is unfortunately not known at 
present against the alternatives against which we are testing, the alternatives 
to H, being case (II) of sec. II with o,?+0, and to H2, case (II) with o,? and 
ri not i. zero; this is true even if we add the normality assumption on the 

(aj, b;)}. 

Suppose there are J runs under the same conditions, with J; points on the 
ith run (¢=1,2,---, J). 

To test H, we use the statistic 


F. 1 = N. 1/ é -*, 
where the numerator J is 


N= ) Si(5; a 8)2/(I ia 1), 





FITTING STRAIGHT LINES 


with 
p= Dss./ D8, 


and the denominator is 
é? = bi (J; > 2)67.,:/Ve, 


ve = DOs — 2). 


Under H; and the normality assumption on the {u,;} and {v,;}, Fi has the 
F-distribution with J—1 and », d.f. We reject H; at the level of significance p 
if F, exceeds the upper p point of the F-distribution with these d_f. 

To test H; we may imagine first fitting a single least squares line 


y=a4+ bx 
to the combined data from the I runs, so 
>» i (ij5 — 2...) Ys 
‘i 
D Lo (ty — @..)? 
.y 


4@=y.— bz, 


§ = 





2. ED 2 / Ls v2 DD w/ Le 


The test statistic is 


F, = N:3/é.’, 


where the denominator ¢,” is given by (32) and the numerator N; is 


N; = (21 a 2 > me (Ysj -a4-— bx;;)? ao rae]. 


i 


Under H; and the normality assumption on the {u,;} and {v,;}, F2 has the 
F-distribution with 2] —2 and », d.f. We reject H: at the level of significance p 
if F; exceeds the upper p point of the F-distribution with these df. 


Vv. ANALYSIS OF THE DATA 


We now consider the evaluation of the data for a set of J runs made under 
the same conditions. If there are two sets of runs, J, with one set of conditions 
(for example, with one catalyst), and J, with another, comparison between the 
two sets may be made by first estimating the quantity of interest for each set, 
namely, 8, », or £, and the variances of the two estimates by the methods below, 
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and then estimating the variance of the difference of the estimates, utilizing the 
independence of the two sets. 

A critical decision which must be made in analyzing the data is whether or 
not to assume the hypothesis H, in the case of problems of type (i), or H2 in the 
case of problems of types (ii) and (iii). This decision may be based on (a) ex- 
tensive experience with similar situations (for example, many runs with other 
catalysts with several runs per catalyst and more than two points per run), 
(b) a theory about the possible effects of uncontrolled factors (for example, 
that they may affect the location of the true line but not the slope, i.e., that 
H, is reasonable but not necessarily H:), or (c) a statistical test of the hypothe- 
sis, described in sec. IV, and based on the data being analyzed. If the decision 
is based on (c) there are the usual theoretical difficulties associated with a pre- 
liminary test of significance, aggravated by our present ignorance of the power 
of the test. 

If we assume the hypothesis H,, formula (26) becomes 


Var (5;) = o.2/S;. (33) 
If we assume Hy, (29) becomes 
Var (9:) = o.°/T:, 
where 
1/T; = Je* + Soto — %.)’, 
and (30) becomes 
Var (&) ~ B-*0,7/U/, 
where 
1/Ui = J + Sx. — * 


Let U; denote the quantity obtained from U,’ when £ is replaced by the esti- 
mate ¢ defined in the next paragraph. 

Let us denote by @, =, 7 the weighted averages of the estimates {6,}, {7}, 
{é;}, using the weights {S,}, {7,}, {U.}, respectively, thus 8 is given by (31). 
Then we find from (26), (29), (30) that if H, is true 


Var (8) = a / > S;, 


and if H; is true 


Var (4) = J XT, 


Var (t) ~ Be, / x U;. 


Estimates of these variances may be obtained by replacing ¢,* in these formulas 
by @.? defined by (32), and in the last formula replacing also 8 by 8; the result 
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of the last substitution will be another approximation (i.e., an approximately 
unbiased estimate). 

We consider next problems of type (i) in the case where the hypothesis H, 
is not assumed. We may then wish first to investigate the variation of the true 
slope b; between runs, seek its causes, and try to remove them. If at any stage 
we decide to live with some amount of variation in the true slope which it is 
not feasible to eliminate, we then proceed as follows. It is necessary in order to 
estimate the variance of the estimate of the mean true slope 8 to have more 
than one run, or else to assume Var (6;)—or possibly, os? which enters into 
Var (6,)—has the same value as in some other experiment with several runs 
where it can be estimated by the method below. 

From (31) and (26) 


a. oy? u S? 
Fe" (Ea) 


An estimate of this could be obtained by replacing ¢,? by @,? and o,? by the é»? 
given below. However, it is simpler to use for the estimate of 8 the unweighted 
mean 


Var (8) = (34) 





B’ = > 4/1, 


and for an estimate of Var (§’), 


I-X(I — 1)-1 2) (bs — BY)? 


The formula for Var (8’) is 


Var (8’) = I(T p S-—+ a). (36) 


If o.2 and os? were replaced in (36) by 6,2 and é%, we would get (35), but a 
simpler derivation is indicated below. The formulas (36) or (34) are useful in 
designing the experiment, for example to decide whether more runs should be 
taken, or more points per run. If the {S;} are equal, §’ =8; otherwise #’ is not 
only simpler to use, but is also more efficient in certain cases, namely, if the 
o»” term dominates the o,? term in (34) and (36). 

Problems of types (ii) and (iii) may be treated in a similar way if H; is not 
assumed. If we estimate » from the unweighted mean 


n = p n/T, 


Var (7’) has the estimate 


II — 1)" Zo (is — 7’)?, 
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and the value 

Var (9’) = (102 , a Ti) + a4? + 2o%on? + 2000). 
If we estimate ¢ from the unweighted mean 

? = } é./I, 
Var (¢’) has the estimate (approximate) 
Pr — 7 2 — 7, (38) 

and the value (approximate) 

Var (¢’) re e-*J-" (1-0. 2. U;" + oa" + 0,” + 2ton). 


For completeness we append the formulas for the variances of the weighted 
means 7 and ¢ defined above: 


FF (En) 
8-0? a - 
Fe" (Ee) 


The underlying parameters in our model are a, 8, 4”, os”, oa». Estimates of 
the first two are & or 4’, and 8 or f’. Estimates of the last three will be derived 
in the next section as 

6.2 = (I — 1)-'8, — ¢2I- > (J! + S—2;2), 


(aq? + Xo7on* + 2Z0ean), 





Var (7) = 


Var () ~ (aq? + §an? + 2kca). 





é,? = (I — 1)-'8, — @2I-' © S-, (39) 
Gad - (I ns 1)—'Sas + é2I-! > 2.8;—', 


where 
Sa = » (4; ae? &;)*, 
8 = > (6; — 6), (40) 


Sa = ke (4; — &)(b; — 8’). 


VI. DERIVATION OF CERTAIN ESTIMATES 
We shall use the following elementary result: 
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Lemma: If the pairs {(fi, g:)} are statistically independent for different 
i(i=1,---, J), if the {f,} have a common mean and the {g,} have a common 
mean, and if f.= >>; f:/I, g.= ><; g;/I, then the expected value of 


LX fi — FG — 9.) (41) 


(I = 1) L Cov (fs, 99)/I. 


The lemma is easily proved by writing (41) in the form 


D figs = sgh AE? p> gi’ 


before taking the expectation. 

The estimates (35), (37), and (38) may be obtained as consequences of the 
lemma for the special case where g;=f; and E(f;) =@. Then f. is an estimate of @ 
and Var (f.) =I-* >>; Var (f,;) has the estimate J-'(I—1)-! )°; (f;—f.)? by the 
lemma. 

If we apply the lemma to (40) we get 


E((I — 1)-8,) = I-3 D Var (6) =o¢2I- Zz Si + o?, 


the last equality following from (26), and hence an estimate of o? is (39). 
Similarly we may derive the above estimates of ¢,? and a». 


VII. TWO CONCLUDING REMARKS 


If for each (7, 7) the errors (u;;, v;;) have a covariance o,, which does not 
depend on (7, 7), then all the above results are valid if we replace (14) by the 
definition 


Ge,0? = 0,7 + b20,? — Wiow, 
and (25) by 
oe = oy” + (8? + oy") oy? _ 2Bour. 


It would be very desirable to have approximations for the power of the two 
tests in sec. IV against the alternatives considered there, and to have formulas 
for the variances of the estimates 6,, 6”, és, all these derived at least for the 
case where the {a;}, {b:}, {us}, {vs} are all normal. 
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LINEAR CURVE FITTING USING LEAST DEVIATIONS 


Orto J. Karst 
New York University 


A method is developed for finding a straight line of best fit to a set 
of two dimensional points such that the sum of the absolute values of 
the vertical deviations of the points from the line is a minimum. This 
is first done with the restriction that the line pass through any desig- 
nated point. In this case one application of a numerical procedure leads 
to the solution, which may be an unique line, or a sheaf of lines. When 
the restriction is removed, iteration of the procedure is necessary to 
find the line or lines of least deviations. 


1, INTRODUCTION 


T 1s well known that a straight line fitted to a set of data by the method of 

least squares is identical with the regression line whose parameters are esti- 
mated by maximum likelihood methods applied to the simple linear normal 
regression model [2]. 

However, if the question is purely one of fitting a straight line to a set of 
data with no aspect of randomness in the dependent variable, then the question 
of what is the “best” line becomes moot. A line based upon the criterion that 
the sum of the absolute values of the deviations is a minimum, may be rightly 
claimed to be a better “best” line than the more familiar least squares line, in 
the sense that it involves a simpler conceptual basis. Furthermore, statistics 
is beginning to be seriously considered as material for secondary school cur- 
ricula, and is already being offered in many college programs where the stu- 
dents lack the background of calculus necessary to appreciate adequately the 
least square theory. It would seem, therefore, that a theory of fitting a straight 
line, using no calculus for its execution or understanding, and based upon the 
simple concept of minimizing the sum of the absolute deviations might be 
worth considering. 


General Statement of the Problem 
Given a set of points {X,, Y;} i=1, 2, - - - n, find the equation of the line 








Y’=bX +a (1-1) 
such that 
S=>|¥-Y/|, (1-2) 
tml 
is @ minimum,' where 
Y/= bX; + a. (1-3) 
1 Hereafter all tions run from 1 to n, unless otherwise noted. 


118 





LINEAR CURVE FITTING USING LEAST DEVIATIONS 119 


The problem will be considered first with the restriction that the desired line 
pass through any designated point (X,, Y,), not necessarily one of the given 
set of points. The reason for this deliberate loss of generality is, that in reducing 
the number of unknown parameters in the desired line to one, the slope b, a 
direct (although possibly non-unique) solution of the restricted problem be- 
comes possible. Furthermore, the solution of the unrestricted problem requires 
iteration of the procedure developed in solving the restricted problem. 


2. THE RESTRICTED PROBLEM 
Statement of the Restricted Problem 
Given a set of points {X,, Y;} i=1, 2, - - - n, find the equation of the line 


Y’=bX +a (2-1) 
through any point (X,, Y,), (not necessarily one of the given set), such that 
S=>|¥Y;-Y/| (2-2) 
is a minimum, where 
Y,) = bX;+a4. (2-3) 
We first translate the origin to the point (X,, Y,) by the transformation 
z=X-—X, 
y= Y — ¥Y;. 24) 


The restricted problem can now be restated more simply as follows: Given a 
set of points {x., y:}, find the equation of the line 


y’ = bz (2-5) 
such that 
S=Dlu-y| (2-6) 
is a minimum, where 
yi’ = ba; (2-7) 
S may be written 
S= > | ys: — ba; |. (2-8) 


Hence, the problem devolves to finding b in (2-8) such that S will be a mini- 
mum for a known set of points {x y:}, which are related to the given set 
{X,, Y;} through the transformation (2-4). 


Minimum of S=|y:—ba;| 
Consider any one of the terms in (2-8), 
S= | ys: — bz]. (2-9) 
The graph of this equation is shown in Fig. 120A. It consists of two semi-infinite 


straight lines with a minimum at (y;/z;, 0) and slopes — | x,| and | x,| respec- 
tively. It might be noted here that the usual minimizing technique of differ- 
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4 
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yi 
Ki 





Fig. 120A. Graph of S =| y; —bz;}. 


ential calculus is inapplicable, since the derivative does not exist at the mini- 
mum. 
Graph of S= >>| ys—ba| 
The graph of S will be the sum of curves of the type shown in Fig. 120A. 
This is shown in Fig. 120B, where ABCDE is the curve of the specific equation 
S=|3-—b| +|1-—b| + |4-—-2b| 2-10) 
Ss 
A 4 E 











wt 


Fig. 120B. Graph of S=|3—b| +|1—b| +|4—26] as the sum of the individual terms. 
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based on the set of points {2;, y:}: (1, 3), (1, 1), and (2, 4). The curves of the 
individual terms are numbered in order as they appear in (2-10). It is seen that 
S is an open polygon, convex downward, where the slope of each straight line 
segment is the algebraic sum of the slopes of the curves of the individual terms 
which compose it. Thus, the slope of BC is —1+1—2=-—2. Also, it is clear 
3 
that the slope of the extreme left segment AB=— )- |x,| =—4, and the 
3 


i=1 


slope of the extreme right segment DE is >> | x;| =4, Furthermore, it is 
tel 


noted that, reading left to right, the slope of S increases by 2 | x,| above each 
minimum of the corresponding | ys—ba;| curve. Thus, at B, the slope of S in- 
creases by 2 from —4 to —2 since the slope of the constituent curve #2 changes 
from —|22| to +]|22| or from —1 to +1. 

Thus, we can see that in general the graph of S= >| yi—ba,| will be an 
open polygon, convex downward, with the following properties: 


a. The slope of the extreme left hand semi-infinite segment equals — )~ | z;| ‘ 
and the slope of the extreme right hand semi-infinite segment equals 


> |z,]. 

b. The vertices of S are located directly above the minima points (y;/z;, 0) 
of the curves of its individual constituent terms. 

c. The slope of S increases by 2| z;| at each vertex, reading from left to right. 


Location of Minimum of S 


From these properties, it follows that the minimum of S can be located by 
the following procedure. 


a. Rank the y;/z; in ascending algebraic order. (The subscript 7 now refers 
to this ranking order.) 

b. To — >>| z,| add successive values of 2|2;| until a change in sign at ¢=r 
signals the minimum point of S, since it indicates a change in slope of the 
S curve from negative to positive. 

ce. This minimum lies directly above the point (y,/z,, 0). Hence, b=y,/z; is 
the value of b for which S is a minimum. 


Solution of the Restricted Problem 
Since S is a minimum for b=y,/z,, the equation of the line of best fit is 
if (*): (2-11) 
zy 
in the transformed coordinates, or 
y-f.« ({)x on (2-12) 
Ty 


in the original coordinates. 


An Illustrative Example 


To illustrate the method we shall find the line of best fit for the set of data 
shown in columns (1) and (2) of Table 120A. We shall restrict this line to pass 
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through the centroid of these data. The choice of the centroid as the arbitrary 
designated point is motivated "by the fact that the least square line also passes 
through the centroid and hence a comparison of the two lines is possible. 

The various columns of Table 122 summarize the essential calculation neces- 
sary to find the slope of the desired line. The detailed steps of the analysis are 
given below the table. 

TABLE 122 


AN ANALYSIS OF DATA FOR LINE OF LEAST DEVIATIONS 








(2) (3) (4) (5) (6) 
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— Rank 
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Steps of Analysis of Table 122 


1. Columns (1), (2) are the recording of raw data. 
2. Columns (3), (4) are the transformation of variables X;, Y; to x,, y; using 


x= xX $= xX 
¥i= Y; = Y 
where X and FY are calculated as shown beneath the column listings. 


. Column (5) is the ratio y;/z;. These numbers, it should be recalled, are 
the b values at the minimum points of the individual curves of the terms 


| ys — ba;| on the (S, b) graph. 


. Column (6) ranks the data of column (5) in ascending algebraic order. 

. By adding the absolute values of column (3), there is obtained >>| z;| 
=71.0. This is the magnitude of the extreme left and right slopes of the 
function S= >>| y;—bz;|. 
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6. The final step is to add successively to — }-|2x,|, twice the individual 
|x,| in the sequence of their indicated ranks until a change in sign is ob- 





tained. Thus: 

— > |2,;| : —71.0 
2| 2z;| : 3.0 
—68.0 
2|22|: 13.0 
—55.0 
2| 2s | : 9.0 
—46.0 
2\a|: 5.0 
—41.0 
2| 25 | : 17.0 
~24.0 
2\a|: 17.0 

py 

17. Change in sign 
2| 27| : i0.0<! 


Since the index i=7 effects the change in sign, b = y;/z;=.659 is the slope 
of the line of best fit, 


y’ = .659z. (2-13) 
The line of best fit in terms of the original variables is 
Y’ — 30.4 = .659 (X — 62.5) 
(2-14) 
or Y’ = .659 X — 10.8. 


The graph of the scatter diagram of the raw scores and the line of best fit are 
shown in Fig. 124. 

For the purpose of visual comparison, the line of best fit based on least 
squares theory has been calculated and shown in Fig. 124 as a second line. 

Its equation is: 


Y’ = .539X — 3.3. (2-15) 
The two lines intersect at the centroidal point 
X = 62.5, Y = 30.4. 


A small refinement in the computational procedure can be introduced. In- 
stead of adding double the ranked |2;| values to — >>| 2;|, it is clear that we 
could subtract the ranked |z;| values from } >.|2z;| and observe the change 
in sign from positive to negative. Thus, the above computation would become: 
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AE |x|: 35.5 
la|: 1.5 
34.0 
la): 6.5 
27.5 
las): 4.5 
23.0 
las]: 2.5 
20.5 
las|: 8.5 
12.0 
lae|: 8.5 
3.5 
| 7] 8.5 Change in sign 
=5.0— 


The Non-unique Case 


The (S, 6) graph, whose minimum point gives the value of b of the line of 
best fit, may turn out to have a horizontal minimum line segment rather than 
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data of Table 122. 
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a unique minimum point. Thus, its general shape may be as in Fig. 125, where 
CD is the minimum horizontal segment. In this case, there is no unique value 
of 6b for the slope of the best fit line, but rather a range of values be <b<bp 











bc bp 


Fig. 125. The S, b graph in the non-unique case. 


where be and bp are the abscissas of C and D respectively. Of course, in this 
case there is no unique line of best fit, but rather a sheaf of such lines with 
slopes in the indicated range. However, all of these lines have the remarkable 
property that the sum of the absolute values of the deviations of the points 
{x:, ys} is a constant and is less than the sum for any line not included in the 
sheaf, i.e. with slope b outside the prescribed range. 


An Example of the Non-unique Case 


This is now illustrated with a specific problem. Consider the data of Table 
126, where again the point (X,, Y,) is taken to be the centroid. 
The following calculation locates the minimum. 





> ||: 36 
[m|: 18 
18— 
| ae} 18 
“Oe Change of sign 
| zs| 12 
a 
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TABLE 126 
DATA TO ILLUSTRATE THE NON-UNIQUE CASE 
































Yi 
xX Y xX vi — Rank 
z; 
12 5.27 —18 —1.81 . 1005 2 
18 5.68 —12 —1.40 . 1166 3 
24 6.25 — 6 — .83 . 1384 5 
30 7.21 0 .13 20 7 
36 8.02 6 .94 . 1567 6 
42 8.71 12 1.63 . 1358 4 
48 8.42 18 1.34 .0744 1 
¥ =30 
P= 7.08. 


Thus it is seen that the (S, b) polygon has no unique minimum point but 
rather a minimum segment with zero slope between 6=.1005 and b=.1166. 
Therefore, any member of the sheaf of lines y’= br, with .1005<b<.1166 will 
be such that the sum of the absolute value of deviations will be a minimum. 


3. THE UNRESTRICTED PROBLEM 
Introduction 
When the restriction that the line pass through a designated point (X,, Y,) 
is removed, it becomes necessary to find the two parameters of the line 
Y’=bX +a (3-1) 
under the condition that 


S= > |¥Y;-0X;-a| (3-2) 


is a minimum. 

Thus, in (S, b, a) space we have to find the values of b and a which make the 
surface S a minimum. Since the surface is a superposition of half-planes of the 
form 


S;= | ¥;-—bX;-a| (3-3) 
it is clear that it is a polyhedron. 


Geometric Properties of S 


Let the surface S=f(b, a) represented by equation (3-2) be intersected by 
any member of the family of vertical planes, a= pb, which contains the S axis, 
where p is an arbitrary parameter. Then (3-2) becomes 


S= >> | ¥; — 0(X;+ p)| (3-4) 


which is the trace of the plane a= pb on the surface (3-2). This, however, is of 
the same form as (2-8) which was a convex polygon in the (S, b) plane. There- 
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fore, since the intersection of S with any member of a family of vertical planes 
is @ convex polygon, S must be a convex polyhedron. 

S will, therefore, have a unique minimum region which may be a point, a line 
segment, or a closed polygon. The first case represents a unique solution to the 
general problem, while the other cases are, of course, non-unique solutions. 

The boundaries of the plane segments which comprise the polyhedron S lie 
directly above the set of lines 


Y¥,-—bX;-a=0, i=1,2,---,n (3-5) 


in the (b, a) plane. This is due to the fact that each of the terms of S, as in 
(3-3), is a pair of half-planes whose line of intersection is one of the set of (3-5). 
The direction numbers of the plane segments of the surface S can only change 
directly above the lines of intersection of the constituent pairs of half-planes. 

A unique minimum point of S will, therefore, lie above one of the points of 
intersection of the set of lines (3-5). 

A non-unique minimum of S, which is a line segment, will lie above a line 
segment of one of the set of lines (3-5) in the (b, a) plane bounded by its points 
of intersection with two other lines of the set. A non-unique minimum of S, 
which is a closed polygon, will lie above a corresponding closed polygon in the 
(b, a) plane bounded by members of the set (3-5). 

This geometric situation is analogous to the two dimensional case as de- 
veloped in Part 2 and illustrated by Figs. 120B and 125. 


The Local Minimum 


We have developed in Part 2 an analytic method for finding the minimum 
S for a line constrained to pass through any designated reference point. Let us 
now choose any one of the given set of points { X,, Y; I, say (X, Y:) assucha 
reference point. There is associated with this point the line 


Y, — bX¥,-—a=0 (3-6) 
of the set (3-5) in the (b, a) plane. If we consider the vertical plane 
Y, —bX¥,-—a=0 (3-7) 


in (S, b, a) space, Fig. 130, we see that it will intersect the convex polyhedron S 
in a convex polygon, whose minimum point (S;, bi, a.) (assuming a unique solu- 
tion) is associated with the analytic solution obtained as in Part 2 using Xi, Y1 
as the designated point in the following sense: 


a. The line of best fit is Y = bX + a,? 


3-8 
b. The mirimum sum of absolute deviations is S,. ats 


However, all these minimal properties are with respect to X1, Y; and, there- 
fore, are only “local” in that the minimum S; is not the absolute minimum of 
the polyhedron S, but merely the minimum of the polygonal trace on S due to 





2 Note the distinction between Y:—}bX:—a =0, which is a line or plane in (S, b, a) space, and Y =bX +au, 
which is a line in the XY plane of the given data. 
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the plane Y,—bX,—a=0. In other words, of the sheaf of lines in the (X, Y) 
plane through (X,, Y:), ¥ =b,X +a, will have the least deviations, S,, Fig. 129. 


Finding the Absolute Minimum by Iteration 


We shall first show that a local minimum line through (X;, Y:) must pass 
through at least one other point (X2, Y:) of the given data. This is easily seen 
from equation (2-12) which is the local minimum line through any designated 
point (X,, Y,). It is clear that this line also passes through (X,, Y,), another 
point of the given data. For, since the slope 

oe wetted. 3 /, (3-9) 
& we ao 


(2-12) may be written 


Y,-+Y, 
ay 2 


y’-/Y, (X — X;) 
which contains the point (X,, Y,). 
We now propose the following iterative procedure which will be shown to 
lead to the values of b and a associated with the absolute minimum of S. 
a. Choose any point (X,, Y,) of the given data* and determine its associated 
local minimum line 


 g = bX + a). (3-8) 


b. This line passes through at least one other point (X2, Y:) o° the given 
data. Using (Xz, Y2) as a reference point, find its local minimum line 


Y = bX + a. (3-10) 


It will be shown that the sum of the absolute deviations S, associated with 
(3-10) is less than S, associated with (3-8). 

. Line (3-10) passes through another point (X;, Y3) which, in turn, is used 
as a reference point to determine 


Y = bX + as, (3-11) 


with S3<S2 <i. 
. This procedure is repeated until a point (X,, Y,) is reached such that the 
local minimum line 


Y =bX +a: . (3-12) 
does not pass through a new point (Xi4:, V4) but rather reflexes back 


to the previous point (X;1, Ys). This line (3-12) is then the absolute 
minimum line of the data. 





3 Actually, this point could be any point in the (X, Y) plane. However, since the next reference point in the 
iterative process will be one of the given data, there seems to be no reason to make a special case of this step. 
Furthermore, one may be lucky and choose one of the points that contain the desired absolute mir:mum line. 
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The Iterative Procedure Leads to an Absolute Minimum S. 


It is necessary to relate the (X, Y) plane space of the given data to the 
(S, 6, a) solid space, which contains the polyhedron 


S=)> | ¥;-—bX:+a|. 


Consider Fig. 129 which shows the first point (Xi, Y:) of the iterative procedure 
and the sheaf of lines through it, one of which, with parameters (bi, a:) is the 
local minimum and passes through (X2, Y2). 


Y 





x 





Fig. 129. Sheaf of lines with local minimum through (X,, Y:). 


Next, consider Fig. 130 which shows the (S, b, a) space. The sheaf of lines 
of Fig. 129 is associated with the line 


Y, —bX¥,-a=0 


in the (b, a) plane of Fig. 130. Every line of the sheaf through (X;, Y:) cor- 
responds to a point on Y;—bX,—a=0. In particular, the local minimum 
through (X;,¥:) and (X2, Y:) corresponds to the point P;: (b:, a:). The trace 
of the vertical plane Y;—bX,—a=0 on the polyhedron S is shown as the poly- 
gon s. Directly above P; lies P;'’ on S, and the segment P:P;’ is S,, the local 
minimum sum of deviations associated with (X;, Y:). The point P,’ is, of course, 
the minimum of polygon s; but is not, in general, the absolute minimum of the 
polyhedron 8S. 

Returning to Fig. 129, the second point of the original data (X2, Y2), which 
lies on the local minimum through (Xi, Y:), also has a sheaf of lines (not 
shown) and corresponds to the line Y;—bX:—a=0 in the (b, a) plane of Fig. 
130. Since the two sheafs of (X:, Y:1) and (X:2, Y2) have a common line which 
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Y; ~bX,;-a =O 











WYe-bX2-0 =0 


Fic. 130. Traces in the S, b, a polyhedron. 


is the Jocal minimum through (X;, Y:), the corresponding lines in the (0, a) 
plane will intersect at P,. 
Corresponding to the vertical plane 


Y; — bX¥,+a=0 


is its trace on polyhedron S. This is the polygon s:, having its minimum at P,’ 
directly above P:. PP’ is S2, the local minimum associated with (X2, Y2). 

If S, is not an absolute minimum of S, then P,’ cannot be the minimum 
point of polygon s:,‘ due to the convexity of S. Hence, the local minimum S; 
associated with (X», Y2) will lie, in general, above some point (b2, a2), and will 
be less than S). 

Similarly, if S, is not the absolute minimum, the trace s; (not shown) asso- 
ciated with (X;, Y;) will have a local minimum §;, such that S;<S,<S). 
Hence, the local S; minima are a monotonic decreasing sequence. 

Only when the iteration locates the absolute minimum of S will it terminate 
by reflexing back to the previous reference point. The process cannot terminate 
at other than the absolute minimum since termination implies a minimum 
region on the polyhedron S, and due to its convexity only one such region 
exists. 

For example, in Fig. 130, if the analysis using (X2, Y:) as reference point 
reflexes back to (X;, Y;), this implies that S; is the minimum of both polygons 
8, and 8, and hence polyhedron S has its absolute minimum above P: (bi, ai). 

If the absolute minimum of S is a point, then the two final polygons s;, and 
s, will each have unique minima. If the absolute minimum is a line segment, 





‘ This assumes (X:, Y:) is the only other point of the original data on the local minimum through (X., ¥:). 
If there is another such point, then it must be investigated separately. 
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then one of these two polygons will have a line segment as its minimum, which 
will reveal itself as the non-unique solution of Part 2. Finally, if the absolute 
minimum is a plane region, then both s,;. and s, will have line segments for 
minima. 

To summarize, the iterative process starting with any arbitrary point will 
seek out the values of b and a of the desired line Y =bX+<a such that the sum 
of deviations will be an absolute minimum. 


An Illustrative Example 


In applying this method, the arbitrariness in the selection of the first refer- 
ence point does not seem to be very important. It has been the experience of 
the writer that regardless of the location of this point, even if obviously non- 
central, the iterative procedure will, in a few steps, locate the line of the mini- 
mum deviations. 


TABLE 131 
DATA OF FIG. 131 








x | 0} 20 | +afoa 2 4 
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Considering the data of Table 131 and Fig. 131, we shall deliberately choose 
as (X,, Y:) the non-central point (4, 0). The locai line of minimum deviations 
through this point is determined by the method of Part 2 to be the horizontal 
line Y=0 which passes through (—5, @) as (Xe, Y:) and has S,=17. The 
numerical calculations are omitted but they are similar to those associated with 
Table 122 of Part 2, except that (X., Y:) is used as the reference point instead 
of the centroid. 

Now, using (—5, 0) as reference point (X:2, Y:2), a second application of the 
method yields a local minimura line 


Y 


Y¥ = 167(X+5) 





x 





Fia. 131. The line of best fit for data of Table 131. 
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Y = .167(X + 5) 


which passes through (1, 1) as (X3, Y3) and has S;=15.17. 
Using (1, 1) as reference point (X;3, Y3) we obtain the same line in the form 


(Y — 1) = .167(X — 1) 


which passes through (—5, 0). The fact that this line passes through (X2, Y2) 
and does not pass through a new (X,, Y,) indicates that it is the line of absolute 
minimum deviations. Thus, the line 


Y = .167(X + 5) 


though the points (—5, 0) and (1, 1) is the line of best fit of the given data, 
with a minimum sum of deviations S = 15.17. 

It is interesting to note that even with the non-central point (5, 0) as (Xi, Y1) 
only three iterations of the method were necessary to locate the desired line. 

If the reader wishes to try the method on the data of Table 122 and Fig. 124, 
he will find that the absolute minimum line of best fit contains four points of 
the data, (50, 22), (59, 28), (62, 30), and (71, 36). In this case each of these 
points must be used as a reference point to see if its local minimum line is 
really the absolute minimum line. 

The data of Table 126 which led to a non-unique solution in the restricted 
case when the centroid was the arbitrary reference point, also leads to a non- 
unique solution in the unrestricted case. Analysis shows that a sheaf of lines 
through (30, 7.21) is a set of absolute minimal lines. The range of slopes in the 
sheaf is .1078 <<b<.1250. The line associated with the lower bound also passes 
through (12, 5.27), while the line with the upper bound as slope also passes 
through (42, 8.71). For all the lines in the sheaf, the sum of the absolute devia- 
tions is a minimum 1.65. In this case, in (S, 6, a) space the S polyhedron has 
a line segment as a minimum region, corresponding to the set of b, a parameters 
of the lines in the sheaf. 


4. CONCLUSION 


When the problem of least absolute deviations is extended to higher dimen- 
sions than the plane, the conceptual and arithmetical difficulties are greatly 
increased. Singleton [3] has developed the general n-dimensional theory, but 
it is not simple. If a mathematical regression model could be found for which 
this theory afforded a regression equation having some theoretical basis such 
as the maximum-likelihood, then a much stronger case could be made for its 
use in random sampling situations [1]. However, for the elementary case of 
fitting a straight line to a set of plane points, the theory of least deviations 
has the virtues of relatively easy calculational procedures, and a simple con- 
ceptual basis founded on minimizing the sum of the deviations. 
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A MODIFIED DOOLITTLE APPROACH FOR MULTIPLE AND 
PARTIAL CORRELATION AND REGRESSION* 


RicHarp J. Foore 
Connell Rice and Commission Company 


A method for obtaining multiple and partial correlation and regres- 
sion coefficients on desk calculators is presented that is efficient and 
highly flexible. Insofar as possible, the computations parallel those 
used when fitting interrelated systems of equations by the limited 
information approach. This method is easier for beginners to under- 
stand than the usual Doolittle approach because no back solution is 
required. Efficient methods for (1) coding the data, (2) interchanging, 
adding or eliminating variables, and (3) obtaining stardard errors of 
the function or of forecasts are included. 


INTRODUCTION 


HE author recently assisted in preparing a handbook [6] designed to provide 
‘ae complete description of the steps in statistically fitting, by the limited 
information approach, economic equations that involve two or more variables 
which are simultaneously determined by a system of equations. In connection 
with this, a method of handling ordinary least squares analyses was developed 
which utilizes the same initial steps as those for systems of equations. This per- 
mits statistical computing units to use a comparable approach for all problems 
of this sort. As many readers of this Journal have little interest in the major 
topic of the handbook and as this method of handling least squares analyses 
appears to be an improvement over certain methods commonly in use, the 
author felt that this article might serve a useful purpose.' 


OBTAINING SUMS OF SQUARES AND CROSS PRODUCTS 


The first step in the solution of any problem of this sort is to compute the 
sums of squares and cross products or moments. In the method discussed here, 
augmented moments are used and these are adjusted so that the elements on 
the main diagonal lie between 0.1 and 10. Computations involved in obtaining 
these are discussed briefly in the Appendix. 


OBTAINING MULTIPLE AND PARTIAL REGRESSION AND CORRELATION MEASURES 


The method of determining multiple regression constants discussed in the 
following pages differs in these two important ways from that given in some 
standard statistical textbooks: (1) The use of D, the inverse of the complete 
moment matrix, and (2) the computation of the inverse using a variation of 
the Doolittle method that omits the conventional back solution. Matrices in- 
volved in this and alternative methods are given in the Appendix. 





* This paper was prepared while the author was on the staff of the Agricultural Marketing Service, U. S. De- 
partment of Agriculture. 

1 The author makes no claim to originality. He has borrowed freely from standard textbooks, whose authors 
in turn borrowed from the extensive literature in this field. Certain aspects of the method are similar to that pre- 
sented in Kenney and Keeping [8] and Schults [9]. Other aspects are covered by Dwyer [4]. A brief summary of 
some alternatives in this general area are given by Dwyer [3]. 
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Forward solution.—Use first is made of the forward solution of a scheme for 
inverting matrices and solving systems of equations outlined by Doolittle [2] 
based upon methods developed earlier by Gauss [7].? As applied to a regression 
analysis, we write the adjusted augmented moment matrix in the upper left 
hand corner, as in Table 135, and write next to it a unit matrix of the same 
order. In the method described here, the dependent variable is written first, 
followed by the independent variables in numerical order. Two check sums are 
shown. The first is used in connection with the forward solution and is obtained 
initially by adding all of the elements in each row in the upper section of the 
forward solution. The missing elements must be included in obtaining this 
check sum. The reader will note that space is left for a >>, column, but use is 
not made of this in the forward solution. 

D matrir.—The D matrix, or inverse of the moment matrix, is shown in 
Table 135 immediately following the J part of the forward solution. Before 
starting its computation, a new check sum, }-;, is obtained by adding the ele- 
ments from the I part only of the forward solution for each row. In computing 
the D matrix, no use is made of the >> column as computed in the forward 
solution. 

The element in the ith row and jth column of the D matrix, d;;, is obtained 
by the following formula: 


di; = (1, 1;)(1", I) + (2’, 1)(2”, I) + (3’, 1)(38”, 1) 
+ (4’, 1,)(4”, I;) + (5’, 1)(5”, I) 


where the first term within the parentheses refers to the row and the second, 
to the column designation of the elements in the forward solution.’ Therefore: 


dy = (1)(0.4741) + (—1.0804)(—0.3932) + (—0.3593)(—0.7225) 
+ (0.2958) (0.4711) + (0.1314)(4.5847) = 1.9008 


and 
dig = (1)(0) + (— 1.0804) (0.3639) + (—0.3593) (0.1569) 


+ (0.2958)(—0.8595) + (0.1314)(—7.5270) = — 1.6934. 


These sums should be cumulated directly in the calculating machine. Ele- 
ments in the >-; column are computed in the same way as any other element 
in the D matrix. In the general formula given above, I; becomes }) 7. The 
check is performed by adding the elements in each row of the D matrix, in- 
cluding those not shown because of symmetry. In the table, this check sum 
is shown in column (6) of this section. 

The next to the last (fifth) column of the D matrix need not be computed, 
since it corresponds to the last row (5’’) in the I part of the forward solution. 





2 The abbreviated forward solution is described by Dwyer [4, pp. 51-65], Friedman and Foote [6, pp. 89-92], 
and in many statistical textbooks. In learning the method, some statistical clerks prefer to write down the steps 
instead of cumulating the products directly in the calculating machine. The detailed steps are shown in (6, pp. 7-11) 
and elsewhere. 

* Chernoff and Divinsky [1] recommend this general approach for obtaining the matrix product AB~1A’, which 
is needed in fitting equati by the limited information approach. Since IM“I’ =M~, the same method can be 
used to obtain the inverse of a matrix, provided the identity matrix is used in place of the matrix A. Dwyer [4, pp. 
192-6] describes this approach for obtaining inverses and gives a proof. Many other methods can be used to obtain 
the inverse of a matrix. This approach was used in part because it is similar to computations required in fitting 
systems of simultaneous equations. 
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All the usual measures of partial regression and correlation can be obtained 
easily from the D matrix. These calculations are shown in rows (6)—(12) of 
columns (1)—(5); column (6) is a check column. 

Partial regression coefficients —The calculation of the highest order partial 
regression coefficients, the “b’s,” is shown in row (6). This is done as follows: 

Row (6): Divide each element of the first row of the D matrix, including 
the element in the }> column, by the first element in the first row of D, and 
change the sign of the resulting quotient. Symbolically, bi;.= — di;/di:, where j 
refers to the subscript of the z’s and the column of the D matrix. The check is 
performed by adding the elements in each row in this section of the table. Since 
X, is the dependent variable, no coefficient is attached to it. The —1 in row (6), 
column (1), and the figures in the following rows are written in order to check 
the computations. 

Standard errors of the regression coefficients —The calculation of the standard 
errors of the highest order partial regression coefficients is shown in rows 
(7)-(11). This is done as follows: 

Row (7): Compute d;.d;;, that is, the product of the element in the first row 
and first column of D with the successive diagonal elements of D. The element 
in the >> column is obtained by multiplying di; by Sd;;, where Sd;; is the sum 
of the diagonal elements of D. The check is performed by addition. 

Row (8): Compute d,,;*, that is, the square of each of the elements in the 
first row of the D matrix, excluding the element in the >~ column. The element 
in the Zz column of row (8) is the sum across the row. The check on this row 
is one of recomputation. 

Row (9): Subtract each element of row (8) from the element in the corre- 
sponding column of row (7), including those in the >> column. The check is 
performed by addition. 

Row (10): Compute 1/N’d,;°, where N’ equals the number of observations 
in the sample minus the total number of variables. In this example, 
1/N’d;">=0.0138. Multiply each element in row (9) by 0.0138, including that 
in the >> column. The check is performed by addition. 

Row (11): Compute the square root of the element in the corresponding 
column of row (10), except the el.ment in the >> column. The elements in row 
(11) are the standard errors of the coefficients in the corresponding column of 
row (6). The check is one of recomputation. 

Coefficients of partial determination.—The calculation of the highest order co- 
efficients of partial determination (the square of the partial correlation co- 
efficient) is shown in row (12). This is done as follows: 

Row (12) : Divide each element in row (8) by the element in the corresponding 
column of row (7), except the =z. column. The elements in row (12) are the 
highest order coefficients of partial determination. The check is one of recom- 
putation. If the coefficients of partial correlation are desired, they can be ob- 
tained by taking the square root of the elements in row (12). 

Coefficient of multiple determination.—R?*, 245, the coefficient of multiple deter- 
mination, is obtained by the following formula: 


dymy — 1 


R, 2345 = 
dumy 
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where m,; is the adjusted augmented moment of x; on x;, which is found in the 

first row and first column of the upper part of Table 135. The coefficient of multi- 

ple correlation, if desired, can be obtained by taking the square root. 
Standard error of estimate.—s,.2s, the standard error of estimate, is obtained 


by the following formula: 
1 
8 = V ; 
1.2345 NN’du 


N, the number of observations in the sample, is required because of the use of 
augmented moments. 

Regression equation based on deadjusted data.—Since the regression coefficients 
and their standard errors are computed on the basis of adjusted data, they 
must be deadjusted in order to apply to the original data. This deadjustment, 
carried out in rows (14)—(18), is as follows: 

Column (1): Enter the variables in numerical order. 

Column (2): Enter the regression coefficients, the b’s, obtained in row (6). 
Note that no figure is entered for X,. 

Column (3): Enter the standard errors of the regression coefficients, obtained 
in row (11). 

Column (4): Enter the appropriate values of k; (see page 143 and Table 143). 

Column (5): Compute k;’ =k;/ki. 

Column (6): The deadjusted 6’s are obtained by multiplying the b’s, column 
(2), by their respective k,’. 

Column (7): The deadjusted standard errors of the b’s are obtained by multi- 
plying the s, column (3), by their respective k;,’. 

Column (8): Enter the means of the variables (see Table 143). 

The constant for the equation is obtained as follows: Multiply the dead- 
justed b’s, column (6), by the mean in the corresponding row of column (8), 
using negative multiplication, and cumulate the products directly in the 
machine. Add the mean of X, to the total. This result can be recorded directly 
as the constant in the regression equation shown in row (19). 

The final regression equation, in the following form, is shown in row (19): 


XxX, = 6 T Di2.s15X2 + bis.25X3 + Dis .235-X4 + bis.234X 5. 


The figures in the table within the parentheses are the standard errors of the 
respective regression coefficients. 

The standard error of estimate, 81.235, also must be deadjusted. This is done 
by dividing s;.2%5 by k, [row (14), column (4) ]. In our example, k;=1; there- 
fore, the standard error of estimate is the same on an adjusted or deadjusted 
basis. The indicated computation is shown at the end of row (14). 

The coefficients of multiple and partial determination need not be deadjusted. 

The check in this section is one of recomputation. 

If all of the k,’ equal one, columns (6) and (7) can be omitted. In this ease, 
column (2) is used in place of column (6) in obtaining the constant in the 
equation. 











138 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1958 


ELIMINATING OR ADDING VARIABLES 


If one or more variables are to be eliminated or added, the measures of cor- 
relation and regression can be obtained without rerunning the analysis. 

Eliminating variables—Application of the formula given below, which ap- 
plies if one variable is to be eliminated, yields eiements of a similar D matrix, 
D)x:, for all variables except the omitted one, x,. The elements of this matrix, 
the d;j)x:, can be obtained by the formula: 


didir — dadjr 
dex 





dij = 


where the d’s are the elements of D. These d;;)x, values are used in place of 
the corresponding d,; values in the computations beginning with row (6) of 
Table 135. It is not necessary to compute the entire D),, matrix. In addition to 
the first row, we need only compute the diagonal elements. 

Similarly, if both z, and 2 were to be eliminated, we would compute the 
elements of Dyi,-:, the dij xe;, a8 follows: 


izye Areync — Cirye Ajryei 


drryk¢ 





dij yke - 


Thus, if more than one variable is to be eliminated, the computations must be 
done in steps by eliminating them one at a time. 

Use of the above formulas is easy if only one variable is to be eliminated; it 
becomes more difficult as additional variables are dropped. Sometimes the 
analyst knows fairly well in advance which variable may need to be eliminated. 
If so, he should use them as the highest-numbered independent variables. If 
X, and X; were to be eliminated, this could be done by dropping columns (4), 
(5), (10), and (11) and rows (4) and (5) and their corresponding sections in the 
forward solution. The d matrix then could be easily recomputed and the re- 
maining computations carried out as in any 3-variable analysis. New check 
sums for use in the computations beginning with row (6) probably would be 
advisable. 

Adding variables.—In general, it is easier to drop variables than to add them. 
Hence, as many variables as are likely to be used should be incorporated in the 
initial analysis; some of these can then be dropped if this appears advisable. 
At times, however, a variable will need to be added. Assume that the added 
variable is X¢. Use can be made of all of the computations already made in the 
forward solution. Columns are added between the former columns (5) and (6) 
and between columns (11) and (12), and a row (6) and a corresponding section 
are added in the forward solution. Figures in these columns can be filled in by 
performing the same sort of computations as were done previously. An addi- 
tional product from the new section 6 will need to be added to each of the 
elements in the original D matrix, and a new column (6) and row (6) should 
be added. These steps can be checked by recomputation or by use of a new 
check sum. All of the coefficients should be recalculated, making use of the new 
D matrix and of new check sums. 
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USE OF AN ALTERNATIVE VARIABLE AS THE DEPENDENT ONE 


Due to the symmetry of the D matrix, an alternative variable can be used 
easily as the dependent one if this appears desirable. Steps involved should be 
clear to readers. 


STANDARD ERRORS OF THE FUNCTION AND OF FORECASTS 


Formulas for the standard error of the function at a particular point or of a 
specific forecast are given in Ezekiel [5, pp. 342-5] and elsewhere. In the 
method of regression analysis used by Ezekiel, the inverse of the moment 
matrix that involves only the independent variables is used. He refers to the 
elements of this inverse as the c,;. The following formula may be used to con- 
vert from the elements of the D matrix, based on adjusted augmented moments 
for all the variables, to the elements of the C matrix as used by Ezekiel: 


dydi; — dah; 


Cy= N kk i. 
j dis ,] 





The deadjusted value of the standard error of estimate is used in the formula 
given by Ezekiel. 


EXAMPLES OF THE FLEXIBILITY OF THIS APPROACH 


Two examples from our work indicate the great flexibility of this method. 

First example.—Here the independent variables for a series of analyses were 
the same but the dependent variable changed from analysis to analysis. In 
this case, the independent variables were written first in the forward solution, 
followed by the first dependent variable. The usual computations were per- 
formed, except that the D matrix first was computed only for the independent 
variables. This sub-matrix was recorded on a second worksheet. The complete 
D matrix was obtained by adding a single product to each element of the sub- 
D matrix, and copying the last row of the J part of the forward solution as the 
last regular column of the D matrix. In obtaining the coefficients, the last 
column of the D matrix was used in the same way as the first row normally is 
used, remembering that d,; is the last item in the last regular column of the 
matrix. By thinking through the nature of the modified matrix, the reader can 
easily see the exact steps involved. 

Substitution of the next dependent variable involved (1) placing strips of 
paper over the original xz, and >> columns and over the z; section of the for- 
ward solution and carrying out the necessary computations on these strips, 
(2) adding a new product to each element of the sub-D matrix, (3) copying the 
last row of the J part of the new section as the last regular column of the D 
matrix, and (4) computing the coefficients from the new matrix. Thus, for each 
problem, the only additional work inyolved was that growing directly out of 
the use of the new variable. 

Second example.—This involved six correlation analyses for which we initially 
were interested only in the coefficient of multiple determination, but for which 
we wished to compute all of the coefficients for two analyses that gave the 
highest R?. One set of three analyses involved a price variable as dependent, 
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and current consumption, lagged consumption, and consumer income as inde- 
pendent variables. We wished to try out three sets of weights on lagged con- 
sumption and to use that analysis that gave the highest R*. A similar set of 
three analyses were to be run with current consumption dependent and price 
included among the independent variables. 

The'‘forward Doolittle solution first was run in the usual way, using price 
as the dependent variable, and one of the series on lagged consumption as the 
last of the independent variables. However, only the J, and J, columns in the 
forward solution were computed for that variable.‘ The first element only of 
the D matrix was obtained, and from this the multiple coefficient of determina- 
tion was calculated. The next lagged consumption series was substituted by 
using strips of papers, as in the preceding example, and a second multiple co- 
efficient of determination was obtained. The reader will see that this process 
easily could be continued for as many sets of weights as desired. Having found 
that lagged consumption which yielded the largest R?, we carried out the re- 
maining computations using that variable. 

We might have known in advance that the analysis that yielded the highest 
correlation with price dependent also would yield the highest correlation with 
consumption dependent. However, we did not make use of this knowledge but 
instead obtained R?, 134 by computing dz. from data in the forward solutions 
referred to previously. The analysis yielding the highest R? turned out to in- 
volve the same variables as that for which complete computations were made 
in the first set. We filled in the missing element in the second row of the cor- 
responding D matrix, and used this row in the same way as the first row nor- 
mally is used in obtaining the coefficients, remembering that dz. appeared as 
the second rather than the first element. 

Thus, for this exampie, partial computations were carried out for six 4- 
variable analyses and complete computations for two of them with only about 
twice as much work as for a single analysis. 
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APPENDIX 


Alternative ways of obtaining the highest order partial regression coefficients.— 
In the method discussed in this article, the partial regression coefficients are 
obtained as ratios of certain elements of the inverse of the complete moment 
matrix. Identical answers are obtained by a simultaneous solution of the normal 
equations or by multiplying the inverse of the moment matrix for the inde- 
pendent variables only by a vector of the moments on the dependent variable. 
As the partial and multiple coefficients of determination can be computed as 
functions of the moments and the respective partial regression coefficients, 
identical answers for these values also are given by the several methods. We 
show here that identical answers are given for the regression coefficients for a 
3-variable analysis. Although the algebra becomes more complex, a similar 
demonstration could be given for analyses that involve additional variables. 

We write the complete moment matrix in the following form: 


mi M2 Ms 
M=|{ma M2 M2 
M3 M32 M33 
1. By the method described in this article, 
mi! mis * 


and bis2 = — nt 
my mu 








bie.s = & 


where m,;~' is the ijth element of the inverse of M. Now 


M2M33 — M23” 

















my! 
| M| 
™42™M33 — My3Mzs2 
m:! = 
| M| 
MsMo3 — M3M22 
m3; ) = 
13 
| M| 
Hence, 
M4233 — M3732 M3M22 — M4223 
bis.s “#7 : ’ bis.2 _ : ; 
Mo2Ms33 — Moz M22M33 — Me3 


2. The following matrices are involved in the method that makes use of the 
inverse of the moment matrix for the independent variables: 


bu eh | ies kal 

M32 Mea m3 bis.ot . 

The reader can easily verify that identical answers are obtained for the co- 
efficients. 
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3. Simultaneous solution of the following normal equations also yields esti- 
mates of the regression coefficients: 


Mabr2.3 + Masbis.2 = Mis 
mMesrdio.s + Masbis.2 = Ms. 


A matrix solution of these equations is identical to that in (2) above. 

Advantages of the use of the inverse of the complete moment matrix are not 
fully illustrated by the above demonstration. If only the regression coefficients 
and the multiple coefficient of determination are desired, these can be obtained 
at least as efficiently by either of the last two methods. However, if the partial 
coefficients of determination are desired, a different moment matrix must be 
inverted for each such partial unless the complete moment matrix is used. The 
chief advantage of using the complete moment matrix is that all relevant co- 
efficients can be computed from the elements of its inverse. 

Computations in obtaining adjusted augmented moments.—We assume that the 
reader is acquainted with the general procedure in obtaining moments. We dis- 
cuss here chiefly aspects that relate to their augmentation and adjustment. 

Augmented moments’ are used to avoid rounding errors. The sums of squares 
and cross products, in terms of original values, are cumulated directly on the 
calculating machine, as for any problem of this type. The total is then multi- 
plied by the number of observations and, by use of negative multiplication, 
the correction factor is subtracted from it. The correction factor for an aug- 
mented sum of squares equals the square of the sum of the series; for an aug- 
mented cross product, it equals the product of the sum for each series. The 
result, which represents the augmented moment, is written down directly as 
in Table 143. It should be noted that X,, the dependent variable, is written first. 

A check column should be carried in these and other computations. This 
column is handled in the same way as in any regression analysis; hence its use 
is not discussed here. 

If an error is made in computing the sums of squares and cross products, the 
following method is more efficient than a direct recomputation as a means of 
locating the error. Suppose that the checking operation indicates that an error 
has been made in obtaining the moments with X,;. Continue to calculate the 
moments with X, and, if the check for this indicates that no error has been 
made, we know that the augmented moment between X, and X, in the first 
section is correct. Similarly, if the moments with X; check, we know that the 
augmented moment between X, and X; in the first section is correct. If all other 
moments check, the mistake is in the computation of the sum of the squares 
for X;. If one of the moments does not check, recomputation of the corre- 
sponding element in the first section is indicated. A similar procedure is used 
if the initial error occurs in a moment other than with X,. 

It is a great convenience in computations to have all the elements on the 
main diagonal of the moment matrix close to 1.6 A set of values that are powers 
of 10, the k;, where 7 is the variable to which it applies, is chosen such that when 





5 An augmented moment equals the actual moment multiplied by the number of observations in the sample. 

* In some computational schemes, this is accomplished by working with the matrix of simple correlation co- 
efficients instead of the moment matrix. Moment matrices were used in the method presented here primarily because 
they are required for computations in connection with the fitting of simultaneous systems of equations. 
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the corrected sum of squares for the variable is multiplied by the square of the 
k; the answer lies between 0.1 and 10. The value (k;)? is referred to as the adjust- 
ment factor. [he k; areshown directly beneath the respective means in Table 143. 
They are determined in the following manner: The corrected sums of squares 
for X,, X2, and X;, respectively, in Table 143 lie between 0.1 and 10; therefore 
the adjustment factor equals (1.0)? or 1.0 and k equals 1.0 for X,, X2, and X;. 
For X,, however, the sum of squares equals 161.1827 and it must be multiplied 
by an adjustment factor of (0.1)? or 0.01 to bring it between 0.1 and 10; there- 
fore k, equals 0.1. If the sum of squares for X,, for example, had been 1611.827 
the adjustment factor would be (0.01)? or 0.0001 and k, would equal 0.01. The 
adjustment factors for the corrected cross products, the k,k;, are obtained by 
multiplying the k’s for the variables involved. The k,k; are shown directly 
beneath the respective moments in Table 143. 


TABLE 143 


MEANS, AUGMENTED MOMENTS, AND RELATED 
ADJUSTMENT FACTORS! 























Item Xi xX: X; X, Xs z 
Sum 35.1628 | 48.5788 | 0.1299 | 54.7388 | 30.9489 | 169.5594 
Mean? 1.4065 | 1.9431 .0051 | 2.1895 | 1.2379 6.7823 
ky 1 1 1 Py 1 —_— 
Moments with— 
X; 
Moment 2.1088 2.2784 .5799 6.4306 1.4190 12.8169 
Adjustment factor | 1 1 1 eo » -—- 
X: 
Moment 5.2090 .4121 | 21.6403 3.0714 32.6114 
Adjustment factor 1 1 | 3} y — 
X; 
Moment -6735 .9408 1.0659 3.6725 
Adjustment factor 1 Pe > | — 
Xs 
Moment 161.1827 |—4.0788 | 186.1158 
Adjustment factor 01 01 — 
Xs 
Moment 10.6365 12.1141 
Adjustment factor 01 _ 











1 The computations were performed with 6 decimal places, of which only 4 appear in the table; therefore, some 
of the checks may appear slightly in error. 

2 For this example, N =25. 

Adjusted augmented moments are obtained by multiplying the augmented 
moment, the terms in the first row of each section of Table 143, by its appropriate 
adjustment factor. They are recorded directly in the table used in connection 
with the Doolittle solution (Table 135). 

The adjustment process is important, naturally, only when the k, differ con- 
siderably from one. Particularly when working with logarithms or first dif- 
ferences of logarithms, all of the k; normally are close to one. Some computing 
units may prefer to adopt a general rule that adjustments are made only when 
at least one of the k; lies outside the range 0.1 to 10. Had this rule been fol- 
lowed, adjustments would not have been made for this analysis. 





A PROCEDURE FOR COMPUTING REGRESSION COEFFICIENTS 


Dup.ey J. CowpDEN 
The University of North Carolina 
Most statisticians are familiar with the method whereby partial 
correlation coefficients are derived from coefficients of lower order. 
But analogous methods for regression coefficients are apparently not 


widely known, and seem not to be described in the text books. Such 
methods are discussed here. 


PROCEDURE 


EGRESSION coefficients of zero-order, first-order, and second-order may be 
computed as follows.! 


a > 2a; aia 


bs; = ———— _ where z= X — X. 
ai is 


| 1 de 
bi; ~ bebe; | bie by; 
bist a = 
1 — Ddjibi; 1 by; 
he 3 


1 Deyn 





Dij.eu me Biju — Dit. wDej.u - | Dieu Diu 


1— Dye uDdej | 1 Biju 
Dit.u 1 





Note that the formulas for b,;., are exactly like those for b;;.., except for the 
addition of u after the decimal point in the subscripts of the latter. In an 
analogous manner, formulas for third and higher order coefficients can easily 
be written. 

In the following derivation of formulas (1), subscripts 1, 2, 3, and 4 are used 
instead of 7, 7, t, and u, since the former notation seems a little simpler. 

For the regression equation 


L123) = Di2.s%e + dis.2vs, 
the normal equations are: 
(i) Dies ra 22 + dbis.2 b 9 Xot3 = > X22; 
(ii) bie. Dy tors + dis.2 Do ts? = Do rite. 
Dividing both sides of (i) by }>z,?, and both sides of (ii) by >°z;", we obtain: 
(i) Ddio.s + bis.2bs2 = bis; 


(ii) Dye sbes 4- bis.2 = dis. 


1 With respect to the regression coefficient being computed: i refers to the dependent variable; j refers to the 
“active” independent variable; ¢, u, v, - + + refer to the “passive” independent variables, i.e., those “held constant.” 
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COMPUTING REGRESSION COEFFICIENTS 
Solving these simultaneous equations, we obtain: 
bis es Disbse . 
1 — desbs: 
bis —_ Diodes 

1 — bubs 


bis.s _ 


bis.2 7? 


For the regression equation 
71023) = Die.sa%2 + Dis.247%3 + Dis 2304, 


the normal equations are: 
(i) Diese > Xo? + Dis.24 pa Xot3 + Dig.o3 bo int, = y L122; 
(ii) Dis.s4 po Lots + disc Zz x3” + discs z U3, = , 4123; 
(iii) Die.s y Lot4 + Disc > 3X4 + dig.23 , i zz? = 7 2X4. 
Dividing both sides of (i) by >z#, both sides of (ii) by > 3, and both sides 
of (iii) by }o22, we obtain: 
(i) Die.ss + Dis.sabse + Disosda2 = D1; 
(ii) Dis.sades + Dis.c4 + Dis.osbss = dis; 
(iii) bis. sada + bis.cadss + bis.2s og Dis. 
Solving together (i) and (ii), we obtain: 
(IT) bis.a(1 — bs2bes) + Dis .23( Das = bs2bes) = bis _ Diodes; 
(III) bis.c4(Os4 — Ds2bo4) + Bis.2(1 — Daobrs) = bis — diodas. 
Dividing (II) by (1—Ds2be3), and (III) by (1—b42ba), we obtain: 
(II) bis.c¢ + Dis.c3ba3.2 = 13.2; 
(IIT) bis.c4s4.2 + Discs = disr. 


Solving these equations simultaneously, we obtain 


bis.2 a bis.2bs4.2 


bus: = 





i- bas.2bs4.2 


In a similar manner, formulas for bj3.42 and by2.43 may be derived. 

If h is the order of the regression coefficient being computed (the number of 
variables held constant) and g is the number of independent variables in the 
final regression equation, the number of regression coefficients of each order 
that must be computed, if we use formulas (1), is 


(h + 1)(g — h)?. (2) 
Thus if we have three variables in our final regression equation, two of these 
variables being independent, we must compute: 
(0 + 1)(2 — 0)? = 4 zero-order coefficients; 
(1 + 1)(2 — 1)? = 2 first-order coefficients. 
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If we have four variables, three of these being independent, we must compute: 
(0 + 1)(3 — 0)? = 9 zero-order coefficients; 

(1 + 1)(3 — 1)? = 8 first-order coefficients; 

(2 + 1)(3 — 2)? = 3 second-order coefficients. 
For a 5-variable equation the minimum number of coefficients that must be 
computed is: 16 zero-order; 18 first-order; 12 second-order; 4 third-order. Quite 
likely the abbreviated Doolittle method is then more efficient. 
If we want an internal check on our computations, a somewhat larger number 
of coefficients must be computed. On the other hand, a slight reduction in the 


amount of labor required is obtained by use, whenever possible, of these al- 
ternative formulas. 


Lie; | 
bie = by — dada; = | - ; 


by bis 


it.ju 


| 1 b 
bij. Diju 


Dist _ Dijn — Des uDit ju 





Again notice that the two formulas are exactly alike, except for the addition 
of u after the decimal point in the subscript of the b,;... formula. The abbrevi- 
ated Doolittle solution of normal equations uses the computations indicated? 
by equations (3). 

In order to minimize computations, a consistent policy must be followed 
with respect to variables held constant. In the illustrations that follow, we 
always compute the b;;.. before we compute the b;; 3, and we compute the b,; 4 
only if necessary. Also, we would compute the bj;.s2, the b;;.42, and the bj; 43 for 
a 5-variable problem and avoid holding variable 5 constant. Finally, in the 
subscript following the decimal point t>wu. For a 5-variable problem t>u>v. 


THREE-VARIABLE ILLUSTRATION 


We have these measures of variation and covariation. 
> xn? = 28 > X22 = 20 >> zzz = 24 
> x? = 26 D> mts = 15 
> 23° = 28 
We compute the 4 zero-order coefficients required by formula (2). 
bie = 0.76923 bis = 0.85714 
be = 1.0 bes = 0.53571 
bs. = 0.57692 bss = 1.0 





* See Dudley J. Cowden, “Correlation Concepts and the Doolittle Method,” Journal of the American Statistical 
Association, 38 (1943), 327-34. In this paper, the symbols were carefully chosen to facilitate understanding of re- 
gression and correlation concepts. 





COMPUTING REGRESSION COEFFICIENTS 
Using formula (1) we compute the first-order coefficients. 
bis — biebes §=—- .85 714 — (.76923)(.53571) 
1— bubs 1 — (.57692)(.53571) 
bis — dishes = .76923 — (.85714)(.57692 
i-th" Te Ikea = 30762 


bse = = .64414. 














bis.s ~ 


Note that the two denominators are the same. 
We can compute the multiple correlation coefficient and the two partial cor- 
relation coefficients by conventional procedures. If we wish to compute 


ris2 = Vobisedbs.2 and firs = Vbis.sba.s, 
we need two more regression coefficients, 


pp Oban _ bn — baba 
31.2 1 i, hale 21.3 a bisba: 


which requires also that bs: and b;,; be computed. For the multiple correlation 
coefficient one can conveniently use any cf the following procedures: 


P > 2123)? a bis.s > rt + bis. > zs : 
71 (23) — >: ay? _— > a? , 
T1(23)" = (1 — ri2*)(1 — ris.2”). 


L— Tics)? wre (1 pn ris*)(1 ~ Ti2.3"). 





The procedures outlined in this paper have no special advantages with re- 
spect to computations of standard errors of regression coefficients. How- 
ever, such standard errors are easy to compute. First we compute the unex- 
plained variation: 


Z. nu. = 7 2,3 — >21 2)? = 7 xi? — bie Zz U22; 
Zz ns’ = 2. x? — > Xs)" = : rt? — bis Yo T23; 
} L2.3° = +b 2° — z. 223)? = , om x2” — bes z. T2X3; 
> X32 = > z;* — ya 32)" sa a > rs" — bse To. 


Then the estimated variances of the regression coefficients are obtained in the 
usual manner: 


81.2" a, 21.2” 


875.5 = = 


> x2? n—-2) > 2° 








81.3" ¥ + Xi.3" 
Lat (n — 2) Da? 
, 81.23” Zz 21.23" 
8 b12.3 = — ; 
2, 2.3" (n — 3) } 22.3" 
81.23” pn Z1.23" 


2 ob pwr ‘ 
othe _ 23.2" (n — 8) y 3.2" 








rr 
sb: = 
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It is easier, however, to test the significance of the correlation coefficients, 
the tests for the partial correlation coefficients being mathematically equivalent 
to the tests for the corresponding regression coefficients. 


J/n-—-83 
ties = Tizs V . 2 a ; 
Fass 


4/ n—3 
2 = 13.2 ness 
13.2 1 eT 


(n = 3)ri cs)? 
2[1 — ri 23)? ] 


1(23) = 


To test F423), one enters the F table using degrees of freedom 
¥g=n— 3. 
FOUR-VARIABLE ILLUSTRATION 
Consider the following measures of variation and covariation. 
ps 2? = 28 7 Mr. = 20 _ M123 = 24 bar M1% = 18 
> x2? = 26 > Ler3 = 15 F Tok, = 19 
> 2s? = 28 >> an = 15 
7 zr? = 36 
Except for the addition of variable 4, the data are the same as for the 3-variable 
illustration. For the sake of clarity of exposition, no attempt will be made to 
avoid repetition. 
The nine zero-order regression coefficients that must be computed, according 
to formula (2) are 











bis = 0.76923 
bes = 1 .0 

ba = .57692 
ba =. 73077 








bis =0.85714 
bes= .53571 
by =1 .0 

bas ™ .53571 





bis =0.50000 
bas = .52778 
bu= .41667 
bu = 1 .0 





Formula (2) indicates that 8 first-order coefficients must be computed. 


VARIABLE 2 HELD CONSTANT 


VARIABLE 3 HELD CONSTANT 
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Using formula (1) we obtain these results with variable 2 held constant. 


bis — disbes 85714 — (.76923)(.53571) 

bis.2 = = = 64414. 
1 — bebe 1 — (.57692)(.53571) 

bis — bisbx, 50000 — (.76923)(.52778) 

1 ~ fede 1 — (.73077)(.52778) 

bu — bssbu  .41667 — (.57692)(.52778) 

. = hie 1 — (.73077)(.52778) 

bes — babes .53571 — (.73077)(.53571) 

1 — bebe 1 — (.57692)(.53571) 











Dis2 = 








bsa.2 = 








bas.2 - 


Note that although there are four different numerators, there are only two 
different denominators. 


Using formula (1), and also formula (3) where possible, we obtain these re- 
sults with variable 3 held constant. 
= a bis — disbse "a .76923 — (.85714)(.57692) 
1 — bosbse 1 — (.53571)(.57692) 
v0 = Dio — dgelis.2 = .76923 — (.57692)(.64415) = .39761. 
bis — disbss ad .50000 — (.85714)(.41667) . 
1 — Ddasbas 1 — (.53571)(.41667) 
bes — besbss  .52778 — (.53571)(.41667) 
1 — babs 1 — (.53571)(.41667) 
73077 — (.53571)(.57692) 
1 — (.53571)(.57692) 


= .39762. 














bias = 


39209. 








bos = 


= .61034. 








i ¥ be at basbso rae 
oS OL = beabas 
bee = Dae os bsab43.2 = .73077 _ (.57692) (.20874) = .61034. 


Note that in the use of formula (1) four new numerators are required, but no 


new denominators. 
The four second-order coefScients are computed below. 


Dis.se = 


(bo. _ 


lbus.e a 


( 
- 
bis.as *” 


bias = bis.2bs4.2 


.15305 — (.64414)(.18261) 





i= bas.2ba4.2 


bis.2 on bis.obas.2 ait 


1 — (.20874)(.18261) 
.64414 — (.15305) (.20874) 





1 — bss.2bas.2 


bis.2 4 bas.adia.se = 


bie.s ial bis.sdae.s - 


1 — (.18261)(.20874) 
.64414 — (.20874) (.03682) 


.39762 — (.18391)(.61034) 2 





i= bos sba2.3 


bis.s = ba. sdia.se = 


1 — (.39209) (.61034) 
39762 — (.61034)(.03682) = 


.03682. 


= .63645. 


.63645. 


7515. 


37515, 
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It is perhaps worth noting that, having computed 6:34: and bi243, we can if we 
wish compute: 

Dis.se = Dya.2 — D34.2bi3.423 

Dia.2s = bus D bas.sDi2.cs- 


It is of course true that by4.32= 14.2. 
In order to obtain an internal check we may compute: 


b, =. Diss eat) bie sbea.s 2 
~ . i lies 


b; pe: bis.4 —_ Dis.abos.4 ; 
meet oo Oh, 








bios iis bis.abs2.4 





Die. = 
i. 


To obtain these second-order coefficients we must compute also the following 
first-order coefficients. 
VARIABLE 4 HELD CONSTANT 























A STATISTICAL MODEL FOR LIFE-LENGTH OF MATERIALS 


Z. W. BrrnpauM 
University of Washington 
AND 
8. C. SauNDERS 
Boeing Airplane Company 


A statistical model for life-lengths of structures under dynamic load- 
ing is derived. The model makes it possible to express the probability 
distribution of life-lengths in terms of the load given as a function of 
time and of deterioration occurring in time independently of loading. 
The special case of a constant load (or of periodic loading with constant 
amplitude) leads to gamma distributions for life lengths. 


1. INTRODUCTION 


T 1s well known that a structure may be properly designed for a certain static 

load, and yet will fail after some time when exposed to a load varying in 
time, although this variable load may never exceed or even approach the static 
load for which the structure was designed. This failure of materials in struc- 
tures, computed and designed correctly from the static point of view but placed 
in normal use under conditions involving dynamic (variable) loading, is often 
referred to as “fatigue.” 

If a number of carefully specified structures, made as uniformly as possible 
are placed in normal use which involves qualitatively and quantitatively the 
same kind of dynamic loading, they most likely will fail after times which will 
differ from one individual structure to another. The life-length 7 of a structure 
under such normal use is, therefore, not a fixed value but a random variable. 

It is of considerable practical importance to know the probability distribu- 
tion of the life-length T for a structure under conditions of its normal use. For 
example knowing a value 7’ 99: such that the probability of failure before T’ oo: 
is .001 would enable us to replace the structure before this time 7’ .oo0: and thus - 
assure that in the long run no more than one structure in a thousand will fail 
in use. 

Several families of probability distributions have been considered for de- 
scribing the behavior of life-lengths of materials under normal use. The family 
of exponential distributions is the best known and most thoroughly explored, 
due to the work by B. Epstein and his associates (see [3], [4], [5] and further 
references given in these papers). Normal distributions have been used exten- 
sively, sometimes after a logarithmic transformation was applied to the original 
data; for discussion and further bibliography the reader is referred to [2]. In 
addition, a distribution commonly ascribed to W. Weibull has received con- 
siderable attention (see [6] and references given there). 

The exponential distribution has a number of desirable mathematical proper- 
ties, but its usefulness is sharply limited due to the following property: it can 
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be proven that if the life-length 7 of a structure has the exponential distribu- 
tion, then previous use does not affect its future life-length. In other words, if 
a structure has not failed up to a time ¢, then the probability distribution of its 
future life-length 7 —t is the same as if the structure was quite new and had 
been just placed in use at time ¢. There are structures which have this property, 
for example, a jeweled bearing in a watch in the first years of use which, as long 
as it has not failed, is as good as it was in the beginning; or an electric fuse 
(assuming it cannot melt partially) whose future life distribution is practically 
unchanged as long as it has not failed. It is clear, however, that this property 
of the exponential distribution makes it inadequate for describing the life- 
distribution of any structure which, when in normal use, undergoes changes 
affecting its future life-length. 

The normal distribution is an excellent analytic tool, but many life-length 
distributions occurring in practical applications are obviously not normal, for 
example because they are markedly skewed while the normal distribution is 
symmetrical. 

The mathematical model derived in the following comprises exponential, 
norma! and Weibull distributions as special cases and, in addition, leads to a 
wide family of skewed distributions. 

The assumptions of this model are not claimed to have a rigorous basis in 
the available knowledge of the physics of materials. They appear, however, 
plausible, and account for such factors as deterioration, load functions which 
vary periodically or aperiodically, and (see Section 6) can be extended to the 
case of random loading. The main advantages of the resulting model seem to 
be that a) the concepts involved lend themselves to intuitive interpretation, 
b) the resulting probability distributions are manageable, their application re- 
quires only tables which are easily available, and procedures for estimating 
parameters are available and simple, and c) in the three series of carefully con- 
trolled experiments described in Section 6 it yielded very good agreement with 
empirical data. 


2. SINGLE-COMPONENT STRUCTURES 


We consider an elementary structural component, which sustains damage 
(or wear) in time described by a nonnegative function 5, such that 6(¢) is the 
instantaneous damage to the structural component at time t>0. If the compo- 
nent fails at time t, we call ¢ the life-length under damage function 6. It seems 
reasonable to assume that the probability distribution of the life-length will 
depend on the damage function 5. We indicate this dependence by writing the 
cumulative distribution function (c.d.f.) and the probability density of the 
life-length as follows 


Pr {T <t| 5} = F(t; 6) forallt>0 (2.1) 


d 
a Pt 5) = f(t; 4) for allt > 0. (2.2) 


We make the convention that random variables will be denoted by upper case 
letters and their observed values by the corresponding lower case letters. We 
now define the failure rate of our component at time ¢ as the functional value 
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f(t; 6) 


ya(t) = 1- Fan Fit. 8) , 


(2.3) 


The intuitive meaning of y;(t) becomes clear if one observes that 1— F(t; 5) is 
the probability that a component does not fail up to and including t; hence 
ys(t)dt is the probability that a component which has not failed until ¢ will fail 
between t and t+dt. From (2.3) it follows by integration that 


F(t;6) =1—- exp }— f valadash (2.4) 
0 


We now make our first 


Assumption: A 


The failure rate at time t, y;(t), depends only on the elapsed time ¢ and 
the instantaneous damage at time ¢ in the following way: 


ya(t) = w(t) d(t). (2.5) 


Equation (2.5) states that the failure rate at time ¢ depends on the damage 
at that time, but does not depend on values of 6 at either earlier or later 
instants of time. It aiso states that the failure rate at time ¢ is directly propor- 
tional to the instantaneous damage to the component at that time. 

The derivative w’ of w will be called deterioration of the component. If w’(t) =0 
and 4(t)=c, i.e., there is no deterioration and the component is exposed to a 
constant instantaneous damage, then (2.4) becomes, upon setting w(t) =k for 
some k>0, 


F(t; 6) = 1 — e-*¢ 
so that the life-length has exponential distribution with expectation 1/ke. 


If w and 6 are such that y(t) =kat* for some k>0 and some a>1, then 
(2.4) becomes 


F(t; 6) = 1 — exp (—kt*) 


i.e., the Weibull distribution. 
In the general case (2.4) shows that the random variabie 


T 
> on -f v3(s)ds (2.6) 
0 


has the exponential distribution with expectation 1. Hence transformation (2.6) 
of the time-axis always transforms the life-length T into T* with exponential 
distribution. 


3. MULTIPLE-COMPONENT STRUCTURES 


We now consider a structure consisting of m>1 components of the type dis- 
cussed in the preceding section, which is exposed to total instantaneous damage 
5. In the course of time one component after another fails. There is a critical 
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number of failures k<m, such that the entire structure fails when k of its com- 
ponents have failed. This number & will, in general, depend on the structure 
and may be as low as k= 1 (e.g. in the case of a chain where the failure of one 
link causes the structure to fail). 


Assumption: B 


If at time ¢ the structure is sustaining damage 4(t) and if 7 of its components 
have failed at that time, then each of the remaining m—j components is 
at time ¢ exposed to the damage 4(t)/(m—y). 


According to this assumption, irreversible and cumulative damage occurs 
in the structure in the course of time since, as the single components fail one 
by one, each of the surviving components is exposed to an increasing share of 
the total damage 5. The deterioration of the single component, however, will 
remain unchanged, regardless of the number of components which have failed. 

We need the following well-known 


Lemma: For a population with density f, and c.d.f. F, the marginal density of 
the smallest observation y; from a sample of size m is given by 


m[1 — F(m)]""*f(y). (3.1) 
For a proof see reference [7], p. 91. 


Theorem: Under assumptions A and B, if S, is the life-length of a structure 
until k components have failed, then the random variable 


8; 
U= 2f va(t)dt 
0 

has the density 

1 

h(u; k) = ———— y*'!e-“/2 for u > 0, 
(k — 1)!2* 

that is, the x? distribution with 2k degrees of freedom. 


Proof: . 


We proceed by induction. If k=1, the proof is immediate from (2.4). Assume 
then the theorem is true for k=j. Suppose the times until failure of the 7 com- 
ponents which have failed first are s;, - - - , s;. We define the damage function 
5* by 
5(t) 

m 

5(t) 
*(t) ={m—1 


0O<t<s 


& <i < 8 


5(t) 
m—j 
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Let us find the conditional density of the time until failure of a single com- 
ponent under the damage function 6*, when it is known that it must fail after 
8;; this can be done according to (2.4) as follows: 


F(r; 8*) — F(s;; 6*) 
1 — F(s;; 6*) 


exp {— f“swaroar — exp {— [“ewaroar} 
exp {- f“swsrwa 


= 1 — exp \- J ews oae for r > 8;. (3.4) 


i 





Pr {fail in (s;, 7) | fail in (s;, @)} = 





But from (3.3) we have 6*(t) =6(t)/(m—Jj) for t>s; and since the time until 
failure of the (j+1\st component is the smallest observation from a sample of 
size (m—j) each of which has c.d.f. given in (3.4), we obtain upon utilizing 
(3.1) with m replaced by (m—j) the conditional density of time until failure of 
the (j+1)st component 


sj-1 
flsi| 8) = nals) exp {=f na(oath for sis > 


2 


Now, by the identity f(s;.1) = fo’#1f(s;+1) | 8;)f(s;)ds;, and by the induction as- 
sumption and (3.2) we have 


fs) = | f° ntoa] ew {- f"nioabwisy 6.9 


and integration yields the result. 


4. APPLICATIONS OF GENERAL THEORY 
4.1. Case of no deterioration. 


Let us consider the case when single components do not deteriorate, i.e., 
w(t) =a>0, say, and that 4(t) depends only on the load I(t) at time ¢t through 
a function A, i.e., 5= h(l). From (3.5) we have as the density of S, the time until 
failure (we now have deleted the subscript k and regard it as a parameter to 
be determined from the data), 


1 s k-1 ” 
= — | qa! A(l(t))d —atl h(l(t))d —Ip(l . (4.1.1 
f(s) | foraw) 4 exp { f (un) ta (U(s)). (4.1.1) 


I'(k) 


0 
Let v= /*h(I(t))dt; we have as the density of v 
1 yt 


ep ae ees 9” O/E 
g(v) Tae € v>0. 
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Suppose we have a sample », - - - , v, from this population, hence we have 
as the joint density 
l 


. 1 
ees ces o (p. ) O- lala 
I] T(k) a anil 


and as the likelihood 


L = >} (-In (kh) — kiln a + (k-1) Ing, — v7") 


i=l 
from which one obtains the maximum likelihood equations 
aL I’ (k) r 
=—r —-rlat Inv; = 0 
ok P'(k) ~. 
Vv; 
éL rk u 


—=-— = 0. 


0a a a? 


The maximum likelihood estimates &, & of k, a are obtained by solving for z 
the equation 


“in T(x) —-Inz = Yinw = n— ¥ w3 (4.1.2) 
this solution is &, then we obtain @ by the formula 

1 
* 


Tables of d/dx In T'(x)—In z are available [1] so that & can be found easily 
from (4.1.2) and &@ from (4.1.3). 


4 
a= 


l r 
— > uw. (4.1.3) 
ve 


4.2. Case of no deterioration and periodic load function 

In the situation considered in 4.1, let us make the further assumption that 
the load function is periodic with period p, i.e., l(t+p) =lU(t) for all t>0, and 
that the time of failure can only be determined as a positive integer number 
of cycles N so that, approximately, S= Np where p is known. Since in this case 


np p 
v -f h(l(t)dt = nc, where c -{ h(l(t))dt 
0 0 


it follows that we can obtain v;=cn; where n; are the observed numbers of 
cycles to failure, and then use (4.1.2) and (4.1.3) to obtain estimates of the 
unknown parameters of (4.1.1). 


5. GENERAL REMARKS 


As mentioned in the Introduction, the family of probability distributions 
given by the Theorem of Section 3 contains the exponential, the Weibull, and 
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the normal distribution as special cases. The first two statements are obvious 
from the remarks in Section 2. The third statement is an immediate consequence 
of the well-known fact that the x? distribution is asymptotically normal as the 
number of degrees of freedom tends to infinity, so that when k is of the order of 
magnitude of 50 the x? distribution with 2k degrees of freedom is already very 
close to the normal distribution with expectation 2k and variance 4k. For 
practical purposes, therefore, whenever the estimate & obtained from (4.1.2) is 
about 50 or more, one may fit the observations with the normal distribution 
which has the expectation 


l r 
ak = — >} v;=5 


P jal 


and the variance 


ak = */k, 


where 6 is the arithmetic mean of the life-length as transformed by considera- 
tions given at the end of Section 4. 

It may also be worth noting that in the case considered in the Theorem of 
Section 4 the actual number m of components has no effect on the distribution 
of life-lengths, and that the critical number k of component failures at which 
the structure fails is the only number that matters. 

As to Assumption A, it may be pointed out that it constitutes an attempt to 
separate the causes of failure into those due to the load function and those 
developing with the lapse of time regardless of the load. This is done by assum- 
ing that the failure rate factors according to (2.5). The causes resulting in de- 
terioration and hence leading to the factor w not being identically constant in 
(2.5) may be such as corrosion, oxidation, or crystalline changes in metal 
structure; our model would not account for situations where such changes are 
appreciably affected by loading. It is likely that Assumption A will hold for 
load functions / such that sup; /(t) is still well within the elastic region for the 
material considered. 

A basic comment must be made on the nature of the load function /. In the 
foregoing it is assumed to be a known function of time, as is actually the case in 
the experiment discussed in Section 6 where / is a sinusoidal function with 
known frequency and amplitude. While this may be a reasonable approxima- 
tion in some empirical instances, it certainly does not comprise such practical 
situations as an airplane component exposed to randomly fluctuating stresses 
in flight resulting from wind gusts, or sudden random variations in peak stresses 
during the takeoff-landing cycle. The present statistical model of the distribu- 
tion of life-lengths can be extended to treat the random case and work is being 
done to develop the necessary theory. The load function / will then be consid- 
ered as a stochastic process, i.e., a random function of time. Modifications of 
the experimental apparatus and techniques are being made so that various 
realistic random load functions of time can be simulated to obtain data for this 
case. 
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6. EMPIRICAL DATA 
6.1. General description of the data. 


A special testing machine designed and constructed by members of the In- 
strument Development Unit of the Physical Research Staff, Boeing Airplane 
Company, was used to obtain data which would lend themselves for testing 
the mathematical model described in the preceding sections. This machine was 
capable of subjecting simultaneously six rectangular strips of material of 
standardized dimensions to repeated alternating stresses. Each of the six strips 
was clamped at the ends and deflected in the center to either side from the 
position of equilibrium, with constant frequency and with amplitudes which 
may be either constant (case of periodic loading) or random with a determined 
distribution. Elaborate precautions were taken to avoid clamping stresses and 
other extraneous damaging effects. The nominal maximum stress in the case of 
periodic loading was computed from the maximum amplitude. 

At the time of this writing, three sets of data have been obtained, each for 
periodic loading with a frequency of 18 cycles per second, and for maximum 
stress of 21,000 psi, 26,000 psi, and 31,000 psi. The material used was 6061-T6 
aluminum sheeting, cut parallel to the direction of rolling. For each of the 
three maximum stresses, 17 sets of six strips were placed in the machine and 
run until all of them failed, thus yielding 102 observations. The machine is 
equipped with a servo-mechanism adjusting automatically for changes of 
amplitude which occur after a strip fails, and with clocks recording the time of 
failure for each strip. Statistical multisample procedures were used for each of 
the three sets of data to test the hypothesis that the six strip positions in the 
machine are equivalent; in each case this hypothesis was comfortably accepted. 
For each of the three series of observations the gamma distribution fitted ac- 
cording to our model was found to yield very good agreement with the empirical 
data. In particular, the remarkably good agreement at the left tail suggests the 
use of the fitted distribution for estimating low quantiles of the distribution 
of life-lengths which, as mentioned in Section 1, are of considerable importance. 
Clearly more experimental work will be needed to justify such practical 
applications. 

As an illustration, one of the three series of observations is presented in the 
next section. All life times in the following tabulations and discussions are ex- 
pressed in thousands of cycles. 


6.2. The 21,000 psi data. 


Of the 102 strips tested one had to be discarded for extraneous reasons, and 
101 observations are presented in Table 159. The estimation procedure of 
Section 4 yields 


k=118, & = 118.76-10'. 


Using tables [8] of the Incomplete Gamma function I(u,p) one finds the fitted 
cumulative probability function 


P{N < 407.95u} = I(u, 10.8). 
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A graphical comparison of the observed and the fitted cumulative distribution 
functions is presented in Fig. 160, which shows very good agreement. This 
is corroborated by a chi-square test of goodness of fit which yielded a x? value 
of 4.49 for 6 degrees of freedom, corresponding to a probability level between 
.50 and .60. 
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TABLE 159 


LIFETIMES UNDER PERIODIC LOADING, MAXIMUM STRESS 
21,000 PSI, 18 CYCLES PER SECOND, 6061-T6 ALUMINUM COU- 
PON CUT PARALLEL TO THE DIRECTION OF ROLLING. 
OBSERVATIONS LISTED IN INCREASING ORDER 








1055 1270 1502 
1085 1290 1505 
1102 1293 1513 
1102 1300 1522 
1108 1310 1522 
1115 1313 1530 
1120 1315 1540 
1134 1330 1560 
1140 1355 1567 
1199 1390 1578 
1200 1416 1594 
1200 1419 1602 
1203 1420 1604 
1222 1420 1608 
1235 1450 1630 
1238 1452 1642 
1252 1475 1674 
1258 1478 1730 
1262 1481 1750 
1269 1485 1750 
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ON THE DISTRIBUTION OF SOLUTIONS IN LINEAR 
PROGRAMMING PROBLEMS 


Harvey M. WaGner* 
Stanford University 


Objection is taken to Babbar’s [1] treatment of the problem of the 
distribution of solutions in a linear programming model. It is argued 
that Babbar’s approximation neglects the mathematical constraint of 
non-negative solutions, and that his general approach to stochastic 
linear programming deals with a restricted (if not unrealistic) set of 
assumptions as to availability of data and the framework for decision 
making. An outline is given of other applications of linear programming 
models in which a policy decision must be made under conditions of 
uncertainty, and the difficulty of obtaining theoretical solutions to the 
associated problems is cited. 


ABBAR [1] has presented an approach for approximating certain properties 
B of distributions of the values for X in the solution of the model 


Model (1) (B + b|)X = (Q+ 8), 


where B is an m Xm non-singular matrix of known constants, b is an mXm 
matrix of random errors, and ¢ is an m-column vector of the corresponding ran- 
dom errors. Also Babbar considers the distribution of the linear form (C+c)’X, 
where C is an m-column vector of known constants, and c is an m-column vector 
of the corresponding random errors. 

Having investigated the approach to the above problem, Babbar then applies 
the results to a general linear programming problem 


Model (2) (B+bX+(G+gWS(Q+60), X,W20 
maximize (C + c)’X + (H +h)'W = y, 


where (G+g) is an m Xn matrix, and W is an n-column vector. Without loss of 
generality, we shall use Babbar’s notation which defines X to be the “active” 
vector in an optimum solution of 


Model (3) BX +GW <Q, X,W20 


maximize C’X + H'W = y; 


by our notational convention, W =0 given the optimum values for X. Paying 
heed to the spirit of Babbar’s paper, we assume that model (2) contains a set 
of coefficients which are to hold for a certain period of time, and model (3) in- 
dicates the expected values for these coefficients.! 

It is the purpose of this note to point out that Babbar’s suggested analytical 
extension is in error,? and the problem he sets out to solve, even if the error 
were corrected, is of limited importance relative to currently unanswered 
questions in this area. 





* The author is indebted to Robert Solow for reading an earlier draft of this note, which was written while the 
author was at Massachusetts Institute of Technology. 

1 More specifically, the expected values of 6, g, ¢, c, and A, are sero. 

2 We shall not discuss the merits of Babbar's approximating procedure in the analysis of model (1). 
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We find it helpful to begin by briefly outlining possible circumstances which 
a user of a stochostic linear programming model might face. The fundamental 
problem of such a policy maker is that he must decide which activities to use and 
what the values for these activities will be. Depending on the availability of 
data, he may make his decision knowing the coefficients in model (2) and/or 
model (3); hence, there are three possibilities we consider. 

Circumstance 1. Babbar’s assumption is that the coefficients in (3) are 
known, the variables X are selected by optimizing with the coefficients of (3), 
and, subsequently, when “current” data is available, the values for X and y are 
determined by the coefficients in (2) (W=0). 

Let us first take note of the error in applying the approach of (1) to the 
problem (2). Babbar’s distributions of 2, the value of the kth variable in X, 
and of y are derived subject only to the constraint that (1) is satisfied once the 
values of B+b and Q+¢ are given. But the solution to (2) must also take ac- 
count of the constraint that X 20; hence, Babbar’s probability functions, con- 
fidence limits, etc., determined without restricting the solution of (2) to feasible 
(i.e. nonnegative) values of X, are thus in their present form not applicable to 
the analysis of (2). 

The assumptions of this case are somewhat peculiar, because they imply that 
a commitment as to the variables must be made before the coefficients in (2) 
are known, but the values for these variables are determined after (2) are 
known. Unless the latter situation does in fact prevail, the approach of “pre- 
selecting” the variables does not seem reasonable, as we shall elaborate below. 

It is suggested that, under this circumstance, we might also want an analysis 
of the question: Let the values for a certain subset of activities in X be those 
determined from (3), and if these values are still feasible in (2), then what is 
the distribution of y, and what is the probability that the solution from (3) 
will be feasible in (2)? Under certain simplifying assumptions as to the subset 
of X chosen and the joint probability distribution of the errors, these questions 
may be easily answered. 

Or perhaps we might want to knew whether there is another set of variables 
X* such that if we solve (2) for the values of X*, or if we use the values of X* 
obtained from (3) (note we allow C*’X*<C’X), then we are maximizing the 
probability of a feasible solution, or the probability that if the solution is 
feasible, y27, where y is some constant. Babbar only mentions the problem 
of considering X* for which C*’X*=C’X, i.e., alternative optima in the solu- 
tion of (3). 

Circumstance 2. The coefficients in both (2) and (3) are known. Several ap- 
proaches are open to the policy maker. One is that he “pretend” the information 
for (2) is not available, and he selects a policy such as one of those mentioned 
under Circumstance 1. For example: Knowing the coefficients in (3), he decides 
to use activities X, and lets W=0. From B+b, Q+<e, and C+c he solves for 
the values of X and uses them, provided they are nonnegative. Under this 
procedure (having defined our course of action when an x turns out to be 
negative), we determine confidence intervals for 2, and y. 

But if all the coefficients in (2) are known, why should the policy maker 
select the activities X? Instead, he should solve for the optimum in (2), which, 
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of course, may not yield the same activities X. In other words, if (2) is solved, 
the optimal basic variables, as well as their values are determined randomly 
depending on the joint probability distribution of b, g, ¢, c, and h. 

We then want a solution for the probability distribution of any variable in 
(2), given that (2) is optimized, i.e., given that the components and values of 
X are selected by optimizing (2) when B+b, G+g, Q+e, C+c, and H+h are 
known. 

Circumstance 3. Only the coefficients of (2) are known and accordingly an 
optimal solution is defined by reference to (2); as in the previous case, we seek 
probability distributions for such solutions. 

We might also pose the inverse problem. By observing statistical data, we 
have the coefficients in (2), and believe there is an underlying model (3) whose 
coefficients are unknown to us; we want to derive confidence intervals for the 
optimal solution of (3). 

In brief, it appears that Babbar has addressed himself only to a limited 
problem in linear programming under uncertainty and even, in this case, the 
formulation is not at present applicable. 

As professionals in linear programming will attest, general solutions to our 
questions are not expected to be readily forthcoming. 
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* Tintner [3] has advanced an ative method (rather than a general analytic solution) for deriving the 
probability distributions. He iders a collection of values for the error terms and solves model (2) for each 
specific set of values. Thus, if there are g exror terms in model (2) and the possibility of each error term taking on 
one of r values is examined, Tintner’s method is to solve r* specific linear programming models, each occurring with 
an associated probability, given the values of the error terms. Thus we numerically approximate the distributions of 
y and z,. For even moderate sized problems, r* linear programming calculations may be prohibitive on present 
day high speed electronic calculators. Dantzig [2] has made some inroads for certain special cases. 











ON RANKING PARAMETERS OF SCALE IN 
TYPE III POPULATIONS* 


K. C. SEAL 
University of Calcutta 


The problem is considered of selecting, from a given set of Type III 
populations with unknown scale parameters, a subset which is most 
likely to contain the population having the largest or smallest scale 
parameter, with the help of a decision rule which ensures many desir- 
able properties. An infinite class of decision rules is shown to possess a 
majority of these desirable properties but among this class only one 
rule is considered to be optimum in the sense that it alone will usually 
minimize the expected size of the selected subset. The optimum rule is 
shown to depend on the extreme order statistic. The usefulness of this 
problem is illustrated by a few examples. 


1, INTRODUCTION 


N MANY practical situations the experimenter becomes interested in a de- 
| cision procedure which would tell him which population or populations 
possess a desired characteristic. For example, he may desire to select from a 
given number of normal populations a group containing the population with the 
largest mean. This problem has recently been discussed by the author [8]. 
Alternatively, he may be interested in selecting a group containing the popula- 
tion with the largest or the smallest parameter of scale. The latter problem will 
be studied in this paper for the Pearsonian Type III Population only. A some- 
what similar problem has recently been studied by David [2] who has essen- 
tially considered modifications of some of the procedures proposed by Tukey 
[10] and Duncan [3] for ranking means of normal populations, when these are 
applied for ranking variances. A somewhat different problem in ranking popula- 
tion variances was also studied by Bechhofer and Sobel [1]. 


2. FORMULATION OF THE PROBLEM 


Suppose there are n+1 populations 


k 
dF; = As e*itig Flr, a->0,k >1,1 =0,1,--:,n, 
'(k) 
with known k, but with unknown parameters of scale a;-', i=0, 1,---, n, 
where z; is a random observation from the 7th population which will henceforth 
be designated as G(a;, k). It is desired to choose a group of populations of size 
v(a) from the above n+1 populations, with the help of a decision rule which 
ensures the following desirable properties: 


A. The probability of retaining the most desirable population, which in our 
case is the population having the smallest a,;-', or in other words, the 
largest a;, in the selected group of size v(a) (not known in advance) is 
maximum among all ((%3}) in number) possible groups of the same 


size v(a). 





* This paper was presented at the 43rd Session of the Indian Science Congress Association held in 1956. 
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B. The least upper bound of the probability of not including the most de- 
sirable population in the selected group is a(0<a<1) whatever may be 
the unknown a;,’s. 

. The probability of rejecting from the chosen group any population not 
having the largest parameter of scale is not less than the probability of 
rejecting the most desirable population. This property will be called the 
“property of unbiasedness.” 

. Corresponding to any y(0<7<1) there exists a constant ao, such that 
the chance of rejecting the population with parameter of scale a) from 
the selected group is greater or less than y according as ap is less or greater 
than doy. This property will be designated the “property of gradation.” 

. The decision rule should have easy practical application. 

. If more than one rule satisfies the above five desirable properties, that 
rule is the optimum which usually minimizes the expected size of the 
selected group. 


A decision rule possessing all the desirable properties stated above is formu- 
lated in the following sections. 


3. A CLASS © OF DECISION RULES 


Let y,, 7=0, 1, ---,, be n+1 random observations from G(1, k) and let 
Ya) <Ya@ < +++ <Ym be n ranked observations among y:, -- - , Yn. The y;’s 
will then define another set of random variables Y;;, 7=1,---, n. It is as- 
sumed that y,#y;, t4j, since continuous distributions are being considered. 


Let Fa(ai, +++, n)(es>O, t=1,-°--, 0, >? c;=1) denote the upper 100a% 
point in the pdf of 


Yo 
F(a, a8 *.» &) = 


n 


>: ci Y «iy 


1 


A class @ of decision rules D(e, «+ +, ¢n)(e:>0, i=1, -- +, n, >o?e;=1) isnow 
formulated as follows: 
“Reject any observation xo from the given observations z;,i=0, 1, - - - , n, if 


Xo 


> F.(ai, +--+, Cn) 


nm 
D etc 
1 


and accept otherwise, where ra) <2~@)< - + - <2) are n ordered observations 
among (21, %2,-**, 2,). Proceed as above for each of the (n+1) observed 
values separately so that each of them takes the place of x» and the remaining 
ordered values play the role of xq), - - - , 2~).” Thus in this procedure we may 
start with the smallest observation among z;, 7=0, 1, - - - , n as 2» and work 
upwards. If any observation is rejected, all other observations larger than this 
observation are automatically rejected. Suppose that the number of popula- 
tions thus retained in the selected group is v(e, - -- , Cn; @). 

For the sake of convenience, we shall denote the decision rule D(e, - + - , ¢a) 
by («) D when c;=1/n, i=1, +--+, n, and by (ii) D(r), 1<r<n when c,=1, 
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e;=0, j¥r. The corresponding auxiliary statistics F(c, - - - , ¢,) will be de- 
noted by F and F, respectively. Clearly F follows the usual F-distribution with 
2k and 2nk degrees of freedom. 


4, EXISTENCE OF DESIRABLE PROPERTIES 


At first it is shown that any rule belonging to class © possesses the four de- 
sirable properties A, B, C, D. The proof of property A depends on the following: 

Lemma 4.1. Suppose that X;, i= 1, 2, are two random variables having pdfs 
G(a;, k), i=1, 2, where a,>a,.>0. Then 


P(X, > X:) < P(X: > X;). 


Proof. Clearly the random variables U;=a;X;, i=1, 2, have the identical pdf, 
viz. G(1, k). 


Hence 
P(U,; > U2) = P(Ui < Us) = 4. 
Also 
P(X, > X;) = P(u; > = U;) 


< P(U; > U2) = P(Ui < U2) 


< P(u, <% U;) = P(X; > X,). 


a2 


Thus it follows that smaller observations are more likely to occur from one of 
the Type III populations G(a;, k), i=1, 2, - - - , having larger a;. Hence the 
selected group of size »(a) which satisfies the property A will consist of 2), 
i=1, 2,---+, v(a), where tq)<2%@)< +++ <2) are n ordered observations 
among 2;,7=1,2,---,n. 

It has already been shown in Section 3 that the group chosen by any decision 
rule D(c:, - - -, ¢n) comprises only the smallest observations, the size of the 
group v(a) being determined so as to satisfy other properties enumerated 
above. 

Proof of other desirable properties is based on the following theorem: 


Theorem 4.1: Suppose that F(a%, - ++, @n%,) is the cumulative distribution 
function (cdf) of n random variables X;,i=1, ---,n, and T(wm,---, Ua) isa 
real-valued function of u,, 7j=1, - - - , m, such that 


T (am, 37S QnUn) => T(%, nie? ¥ Un), (4.1) 
where (a1,--+, a) is a set of positive real numbers and — © <u;<~, 
i=1,---,n. If for an arbitrary constant c, 

P[T(Xi,---,X,) >ela,---, an] 


denotes the probability of 7(X:,---, X.)>c when Xi,---, X, have the 
cdf F(a,X1, - ++, @nXq), th 
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P[T(Xi, «++, Xn) > | ana, +--+, anda] 
> P[T(X,---,X,) >cl|a,---, an). 


P[T(X,, ++, X,) > c| @di,°°"*, nn | 
= P[T(aXi,- ++, a.Xn) > c| M%,--*, an] 
> P[T(X,-: », Xn) >c|a,-- -, an), 


T(a,X1, +++, anXn) > T(Xi,---, Xn) 


by hypothesis. 
From this theorem the following corollaries readily follow: 
Corollary 4.1: If (4.1) is satisfied for all a;>1, t=1,---, n, then 
P[T(Xi, - - - , X.)>c] is a nondecreasing function of each of a;,i=1, +--+, n. 
Corollary 4.2: If 


T(mX;, Ya tnX n) > T(Xi, ete Xx), 


for all values of random variables X;,i=1, - - - ,n, a;>1 and a;>1 for at least 
one i, 1<i<n and if the cdf of T(X;, - - - , X,) assigns a positive measure to 
every nondegenerate interval, then P[T(Xi, - - - , X.)>c] where c is an arbi- 
trary constant, is an increasing function of each of a;, i=1, ---, n. 

Corollary 4.3: Any strictly monotonic function of the cdf of T(Xi, - - - , X,), 
which satisfies the conditions of Corollary 4.2 is an increasing function of 
a;,,t=1,-°-+,n. 


Proof of properties B, C and D depends on the following theorem: 
Xo 


g ii F.(c, ie * Cn) 
DL eX «iy 


i I 


is a decreasing function of each of d;=ao/a;, i=1,--+-, n. 


Proof: The proof is similar to that of an analogous theorem (Th. 3.2.1, p. 391 
of [8]). Define X;=aoX;, i=0, 1, - - - ,n. Then noting that P[ 50? ¢:X,’<)] 
is a decreasing function of d;=a)/a; by corollary 4.2 and that the distribution 
of Xo’ does not involve the d,’s, the above theorem readily follows. 

For proving property C let us assume that a,o) > - - - >a;n) denote the re- 
ciprocals of scale parameters of the given n+1 Type III populations when ar- 
ranged in a decreasing order. By our decision rule D the probability of rejecting 
the desirable population G(a,o), k) will depend on (4.2) 


@a) An) 
Paya ore wig ’ 
(0) (0) 
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which is defined as the conditional probability of 


| 


a AY cy < yo/Fa(ti, + + +, Cn) 


1 


when yo is assumed to be held constant. Here Y;, 1=0, 1, - - - ,n are defined as 
in Section 3. The probability of rejecting any undesirable population G(a,,), k), 
i=1, 2,---, mn, will on the|jother hand, involve 


ao, Wa) Qui-1) Aci+1) A(n) 

Payevsg(aes $2, eas ’ ——y ree, ' (4.3) 
Quip Wi) Qi) aii) ai) 

Comparing the arguments of P,,,...,., in (4.2) and (4.3) we notice that 

a(3) Q¢3) 


—s 


a\i) (0) 





and 


a (0) aii) 
} en, Me, 


(i) a.o) 


where 7=1,2,---,n,j=1, | - ++ ,n;j#7. Thus we can make a one-one cor- 
respondence between the n arguments of P.,,...,., in (4.2) and (4.3) in such a 
way that no argument of P.j,...,., in (4.3) is less than the corresponding argu- 
ment of P.,,...,, in (4.2). Hence from the monotonic behavior of P..,....,., 
(6, -- +, 6) with regard to $;,7=1, -- - , n, it follows that the probability of 
rejecting any undesirable population G(a,), k), i=1, - - - , m, is never less than 
the probability of rejecting the desirable population G(a,o), k). This proves the 
property of unbiasedness possessed by our decision rule D(c;, - - - , en). 

It can also be shown in an analogous way that under any rule D(a, - - - , cn), 
the probability of rejecting thle population G(a;, k) from the given n+1 popula- 
tions is greater than the probability of rejecting another population G(a;, k), 
j#i=0, 1,---,n, from the|group, if a;<a,;. From this result an alternative 
proof of property A readily follows. 

It may be noted that none pf the arguments of P,,,...,., in (4.2) is less than 1 
and so the expression in (4.2) jwill not exceed P.,,...,-, (1, - - - , 1). This implies 
that the probability of rejecting the desirable population G(a,o), &) will not 
exceed the desired significance level a(0<a<1). Hence a will be the least 
upper bound of the probably of incorrect choice (i.e., of not including the 
population having ao) in the selected group). This proves the validity of 
property B by our decision rule D(a, - - - , C,). 

Proof of Property D: 

From Theorem 4.2 it follows that 
Xo 


P > F.(er, > ++, Cn) 


> ¢:X (4 
1 


is a decreasing function of ao; |when ao—0, the value of (4.4) is equal to 1 and 
when ao>— ©, the same value is equal to 0. It is easily seen that (4.4) is a con- 


(4.4) 
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tinuous function of ao. Hence corresponding to any assigned value y(0<+y< 1) 
of (4.4) there exists a particular value ao, of a» for which (4.4) is exactly equal 
to y. The value ao, will clearly in general depend upon a, dz, « - + , Qn, Ci, * * * 4 Cn 
besides the assigned value y and it follows that 
Xo 
P agian > Fale, +++, Cn) 27 


z CX i) 


1 
. < 
according as dp=apy. 


CONSIDERATION OF PROPERTY FE 


The rule D(c, - - - , ¢,) is quite simple to apply provided F, (ci, - - - , ¢n) is 
known. Evaluation of this value for any possible choice of (c, « - - , ¢n) is quite 
difficult. Since in the particular case when c;=1/n, i=1, - - - , n, F obeys an 
F-distribution with 2k, 2nk d.f., the 100a% point of this statistic can be readily 
obtained from the known F-Table. Evaluation of 100a% point of F,, 
r=1,---,m will not be very difficult particularly when r=1 and n. Approxi- 
mate evaluation of these values for F,, r=1, n is possible from Hartley & Fin- 
ney’s [6], [4] results to be discussed shortly. Thus the rules D, D(1) and D(n) 
may be considered to possess the above mentioned property Z. Our next prob- 
lem is to select one rule among these three rules from a consideration of the 
property F. It is expected that the rule thus selected will also be optimum 
among the entire class @ of the proposed decision rules. 


SELECTION BASED ON PROPERTY F 


For the present we shall denote the given group of population G(a;, k), 
i=0, 1,---,n by G. It is easy to see that under any rule D(c, - - - , c,) the 
expected size of the retained group selected from G for a given a@ is equal to 


(n+1)— (4.5) 


* “pone of rejecting G(a;, k) from "a 


under D(c, - - -, Cn) at the level a 


i=0 


That rule among D, D(1) and D(n) will be the optimum for which.the sum in 
the expression is maximum. By property B the chance of rejecting the best 
population (i.e., that population which has largest a;) from the selected group 
is always bounded by a, which will in practice be a small quantity such as .05 
or .01. So the major contribution to thé sum in (4.5) will be from the probabili- 
ties of rejecting nonbest populations from the selected group. Hence the rule 
which will usually maximize the chance of rejecting most of the nonbest popula- 
tions from the chosen group should be considered as the optimum. Some argu- 
ments are now put forward to show that the rule D(1) may be considered as the 
optimum from this point of view. 

The close analogy between the proposed decision rules and the usual theory 
of tests of hypotheses is quite evident. In fact, the decision procedures D, D(1) 
and D(n) can be tied up with the well-known problem of tests of hypotheses as 
follows: 





| 
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The hypothesis H» to be tested is Hy: The parameter a= [ao, ai, - - - , @n]’ 
belongs to a subset /\ of the parameter space of ain which ap>a,;,i=1,-- -, 
n; that is, 


oc A = 1 (a > 02, 
1 


and the alternative is 

H,: a belongs to the complementary subset A so that AUA comprises the entire 
parameter space of n+1 dimensions. Clearly A contains all parameter points 
for which ap is not the largest. Thus H» may be taken to correspond to the best 
population, whereas H, is linked with nonbest populations. The critical region 
corresponding to the decision rule D may be defined as the subset # of all sam- 
ple points z= [xo, 11, - - - , 2,’ for which nzo>F, >? 2,, that is, 


NZo 
EP 

1 

n 
— >? 

> diy; 

1 
where di=ao/a;, i=1, 2,---,m and Y,’s have the same meaning as defined 


in Section 3. 
Thus the probability of rejecting the best population under D is 


> F. 


w: 


nYo 
LY: 
1 


by Theorem 4.2, so that a may be considered as the significance level. Further 
the probability of rejecting any nonbest population will be 


Y 
P(xew| H.) = P| ——"— > F. | = P (say), (4.8) 


> di¥; 
1 


where at least one d; is less than 1. Thus P corresponds to the power of the test 
when H, is true. 
Likewise corresponding to the decision rule D(1) the critical region w(1) will 


be given by 
x 
wi): {= > Aah 
21) 


[E> n 
zi 


[= > aFi.|, 
Yi 


P(acew | Ho) < P > F.| =a, (4.7) 
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where F;, denotes the upper 100a% point of F;. Clearly, 
n Yo 
P[xew(1) | Ho] < 1 - P| Nn (> < Fis) | = a 


1 ‘ 


P, = P(zew(1)| H.) = 1 - P[ n (: < iF.) | 


1 rY 


denotes the power of this test when at least one d; is less than 1. 
The critical region w(n) corresponding to the rule D(n) will be 


w(n): 12 > Fan} 


Zn) 


A (: > Fae) 

1 \%y 

n (C > d.Faa), 
1 We 


where F,,. denotes the upper 100a% point of F,. 
It is easy to see that 


P(xew(n) | Ho) < al n (= > Fae) =a 


1 


‘ 


and the power P, of the test corresponding to D(n) is 


; P| A (F > dF se) (4.14) 


1 ‘ 


with at least one d;<1. 

From our previous arguments it follows that the rule which will have maxi- 
mum power for most of the points of the parameter space will be the optimum. 
Since an exact evaluation of these power functions P, P; and P, for arbitrary 
(di, -- +, d,) is extremely difficult, we shall study the behavior of the differ- 
ences P;—P and P—P, for certain subsets of the parameter space of 
(di, «+ - , dx) as indicated below. 

We shall consider the following four different situations. Let “min” and 
“max” denote minimum and maximum respectively. 


I. d,=min (d,, - - - , d,) increases indefinitely, (d,:, - - - , d.) remain finite, 

whereas d,,,:=max (d,i:, ---, d,) decreases indefinitely, where r and s 
are any two integers r<s lying between 1 and n. 

II. d,=min (d,, -- - , d,) increases indefinitely, but (d,i1, ---, d,) remain 
finite, where 1<r<n-—1. 

IIT. (d:, +--+, d,) remain finite and d,,:=max (d,i:,---, da) decreases in- 
definitely, where 1<r<n—1. 

IV. d= --- =d,=d (say). 
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By (4.8) and (4.11) we get 


~ Y as e Y 
ee ee Se. 2 P| n (~ < dF). (4.15) 
Ms 1 \Y; 
> ay; 
1 


Clearly for case I, P:— P-1 as d,—~ and d,4:—0; for case IT, 


nY J » /Y. 
p . ? > a... ¢ J, - P| AC <a.) 


r+l 


n Y 
1 P| a (- < dF sa) 
r+1 Y; 


as d,— «. It is evident that this limiting difference will be quite large for large 
n, small r, large a or small d;, i=r+1, ---, n. For case III, 


‘ 


which tends to 


de nYo 
P, — P— P| —— < F,}, 


> ay; 
1 


as d,.,—0. This value will be large for large s, small a or large d;, i=1, - - - , s; 
as for instance, with k= 10, n=4, s=3, a=.03 and d; =d,=d;= 1.5, the limiting 
difference evaluated from tables for incomplete B-integrals is about .99. For the 
remaining case IV it is easily seen that the difference P,— P=0 for d=1. 
Further, P:— P—0 as d—0 or ~, and so it is expected that this difference will 
be small for all d. This difference can be approximately evaluated by consider- 
ing ¥o/Y;, i=1, - - - , n, to be approximately independently distributed as was 
suggested previously by Hartley [6] and Finney [4] in a similar situation for 
moderately large k. Hence it follows that 


n Y ¢ n 
P| n G. < aF,.) |= [P(> < dP.) ; 
1 Y; Y; 


Since Y,/Y; follows an F-distribution with 2k, 2k degrees of freedom we can 
utilize this relation for approximately evaluating F,. for various a, n and k 
by taking d=1. Thus for a=.028, n=4 and k= 10, F;, is found to be approxi- 
mately equal to 3 from Pearson-Hartley’s Table [7]. For this case IV, 


P, — P= P[Pun < dF.| — {P[Pu.x < dFie]}", (4.16) 


where F,,,,, follows an F-distribution with (, v2) degrees of freedom. The ap- 
proximate difference can be easily evaluated for various a, n, k and d with the 
help of Tables for incomplete B-integral or Pearson-Hartley’s Table [7]. Actual 
calculation with 2<n<5, k=10,0<d<3 with varying a shows that this differ- 
ence (4.16) is very small for these combinations. 

Thus it is expected that P;—P will be positive in most of the points in the 
parameter space and P, will be superior to P by a considerable margin in the 
three cases I, II and III considered above. 
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We shall now study the difference between P and P,. By (4.8) and (4.14) 
we get 


Y n Y 
anaes Sy a P| n (>> Fea) | (4.17) 
1 Y; 


> dy; 
1 


Clearly for the cases I & II, P—P,-0 as d,—~ ; for the case III, 


= Y - s /Y 
oe oe Ye > - | (=> ara.) | 
> dy Beet 
: + + 


which tends to 


Y, - -/Y 
ee oe al n (=> Fea) | 
1 i 


> ay; 
1 


as d,,,—0. This limiting difference will be fairly large for large n, small s and 
d,’s (i=1, - - - , s) lying in the neighborhood of 1. For the case IV, assuming 
approximate independence of Y»/Y;, i=1, - - - , n, it can be shown as in the 
previous case that the approximate difference 


P — P, = P[Pex2n > dFal] — {P[ Fx > dFral}* 


is very small practically in all cases. Thus from a consideration of these four 
somewhat limiting cases it is seen that P, is likely to be inferior (although some- 
times slightly) to P in almost all such situations. The close similarity of these 
results with those derived in an analogous paper (Seal [9]) is quite obvious. As 
was shown in that paper that the conclusions drawn from such limiting cases 
hold good for moderate values of d;’s also, so it is expected that the above 
superiority of D(1) over D and that of D over D(n) will be maintained for 
moderately large or small values of d;’s. Hence D(1) may be considered as su- 
perior to both D and D(n) in almost all points of the parameter space, so that 
the expected size of the group selected from G under the decision rule D(1) is 
likely to be the smallest. Thus the rule D(1) alone may be taken as satisfying 
the property F besides the other five desirable properties A, B, C, D and E. 

Hence D(1) may be considered as the optimum rule for the present problem. 
The close similarity between this optimum rule with the rule proposed for 
ranking means of normal populations (see [5], [9]) is at once evident. 

Likewise it can be shown that, if our problem were to select a group of 
populations which contains the population having the smallest parameter 
among the a;’s, the decision rule of the D(n) type will be optimum among the 
entire class of such decision rules. From these and other similar results ([5], 
(9]) derived in case of ranking means, it appears that among the proposed type 
of decision rules the decision rule of the D(n) or D(1) type will be the optimum 
for selection of a group of populations (other than Normal or Type IIT), ac- 
cording as the population having the largest or the smallest location or scale 
parameter is consicered as the “best” population. 
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As mentioned earlier, for any given a, n and moderately large k, the approxi- 
mate values of F;. can be easily obtained from the relation 


P[ Fox.» < Fie] => (1 — a)", (4.18) 


which can be evaluated from Tables for incomplete B-integral or from [7]. 
When a is small, the expression on the RHS of (4.18) is approximately equal 
to 1—(a/n) so that F;. approximately equals the upper 100 (a/n)% point of 
an F-distribution having (2k, 2k) degrees of freedom, provided n is not too 
great. 

5. ILLUSTRATIONS 


1. Suppose there are n+1 normal populations N(y;, ¢;*),i=0, 1, 2, ---,n, 
with unknown means and variances and that p+1 random observations z,,, 
i=0,1,---,n;j=1,2,--+-,p+1, from each of the n+1 normal populations 
are given, where 2;; is one of the p+1 observations from the ith population. 
Obviously 


pti 
psi? = Do (xy — 2), 


j=l 


p+l 
i; = > xi;/(p + 1), 
j=l 
obeys the Type III population G(1/2e,;7, p/2), =0, 1, - - - , n. Hence the above 
decision rule D(1) can be applied to ps,*, or simply to s,’, if it is desired to choose 
a group from the above n+1 normal populations which satisfies all the six de- 
sirable properties. 

To illustrate the optinum decision procedure D(1) suppose that it is required 
to make a selection between five machines for their suitability in producing a 
certain dimension on a new product as uniformly as possible. Mean square 
estimates of variances based on 21 articles from each machine, when arranged 
in an increasing order, were found to be as follows: 








Machine A B Cc D E 

















a? | .30 54 -80 1.64 3.80 





A group of machines showing minimum variability has to be chosen from the 
above five machines with some specified risk a for not including the best ma- 
chine in the selected group. 

Let a=.02% Here n=4, p=20, and it is easy to verify that F,.=3.00 
(approx.) and F,=1.90. Under the decision rule D(1), it is readily seen that 
the machines A, B and C are selected ; however, under the rule D the retained 
group comprises A, B, C and D. 

The difference in the size of the retained group by D(1) and D becomes more 
marked in this example for a larger a such as .55. In this case F;. is approxi- 
mately equal to 1.50, whereas F, =.95. The selected group under D(1) consists 
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of A alone, whereas the group retained under D comprises A, B and C. The su- 
periority of D(1) over D is quite evident here. 
2. Let there be n+1 populations 


1 
f(z, B;) = e 7 (eBi)—-o op i= 0, 1, sD, 


I(r) 


viz., the limiting distributions of observed rth extreme values having £,, 
i=0,1,---,m as their parameters of location. It is desired to select a group of 
populations among the above n+1 populations which contains the population 
having the maximum location parameter with some specified risk, when one 
observation z; is chosen randomly from the ith population. Suppose a;=e*, 
i=0, 1,---,,s0 that maximum location parameter among 8;’s corresponds 
to the maximum value among apo, a, - - - , aq. It is easily verified that the pdf 
of y=e~* arising from the ith population is given by G(a;, r), t=0,1,---,n, 
so that the decision rule D(1) possessing all the desirable niaiiion: may be, 
applied to the transformed variables y’s in this problem. 
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USE OF RANDOMIZATION IN THE INVESTIGATION 
OF UNKNOWN FUNCTIONS* 


Rosert Hooke 
Westinghouse Research Laboratories 


When one observes values of a function, he usually expects to make 
inferences from these values to other values, or to characteristics of the 
function as a whole. Unless useful assumptions about the function can 
be made, one can make such inferences only if randomization has been 
used. Two problems, that of numerical integration and that of measur- 
ing goodness of fit, are discussed from the randomization viewpoint. 


1, INTRODUCTION 


to show that they can make valuable contributions, but these contributions 
have consisted, to a great extent, of the application of techniques developed 
for the biological sciences. Though it has often been pointed out that there are 
fundamental differences in the experimental problems of these two kinds of 
science, much remains to be done in locating these differences and doing some- 
thing about them. It is the purpose of this paper to examine one of the areas 
(the study of unknown functions) where work needs to be done. 
In the physical sciences, quantitative factors occur more frequently than in 
the biological sciences. In fact, it is often possible to describe an experiment in 
the physical sciences as the investigation of a function 


Se eschaven methods have been used in the physical sciences long enough 


y = f(x, ie * 4 In), (1) 


where the “response” y and the factors ; are all real variables. The methods of 
classical analysis of variance which apply to qualitative factors apply equally 
well here. One would think, however, that study of the case (1) would result 
in the development of new techniques (as it in fact already has, in the optimiza- 
tion problem) for this case, and perhaps also in new ideas fer the clarification 
of problems where only some of the factors are quantitative. 

In planning an experiment to investigate a function such as (1), one should, 
before rushing in with factorial designs, consider (i) the nature of available 
prior information about the function and (ii) the kind of information about the 
function that is being sought. This statement is obvious, but it is made to point 
out an area of possible research which has received too little attention. 

With respect to prior information we have at present the following three 
possible situations: 


(a) The form of the function may be assumed, so that the whole problem 
is one of estimating parameters. 

(b) The form is unknown, but other assumptions can be made which are 
useful in planning the experiment. 





* Paper presented in part at meeting of the Institute of Math tical Statistics, Princeton, N. J., April 20, 
1956. 
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(c) The only possible assumptions are such that we cannot (or do not know 
how to) use them in our planning. 


Problems of type (a) have a venerable history, and will not be considered 
here. Insofar as the author is aware, statisticians have contributed little toward 
problems of type (b). One needs here a classification of functions in terms that 
are useful to the experimenter. This should probably transcend the traditional 
mathematical classification (based on continuity and number of derivatives), 
since experimentation is usually discrete and finite. The categories in the 
classification should be such that an experimenter is likely to be able to assume 
that his function belongs to one category rather than to another, and such that 
knowledge that the function belongs to a particular category will actually be 
helpful in designing the experiment. 

Until some work is done on the problem of classification, those problems 
which are not of type (a) will tend to be of type (c). In this third case, the con- 
tribution of the statistician should be to point out, and make use of, the follow- 
ing fact. If a function is observed at a set of systematically chosen locations, 
then only by making assumptions about the function can we make inferences 
other than to its values at these chosen locations. Hence, in problems of type 
(c), if we wish to make general inferences about the function, as we nearly al- 
ways do, we must select our locations in some random fashion. 

In this paper we are concerned with the problem of making inferences from 
values observed at randomly selected locations. As indicated above, what we 
do should depend on the kind of inference we expect to make, and two kinds of 
problem are considered. The first problem (Sections 2—4) is that of numerical 
integration, which involves essentially the estimation of the average value of a 
function over a given domain. The second problem (Section 5) is the goodness- 
of-fit problem, which pervades much of experimental work in the physical 
sciences, and for which the statistician has not provided very much in the form 
of really useful results. 


2. NUMERICAL INTEGRATION. SYSTEMATIC AND RANDOM ERRORS 


We wish to estimate the value of an integral, say 


b 
Ie f fla)dz, (2) 


where f(x) is an unknown function for which values can be found experimen- 
tally. (The discussion below will also be relevant to the case where f(x) is a 
known function but cannot be integrated in closed form.) Results obtained 
below are easily extended to functions of more than one variable. 

A common procedure of numerical integration is to observe the function at 
n+1 equally-spaced locations 1» (=a), 21, 22, - ++, 2, (=b). The integral is 
then estimated by, say, one of the “Newton-Cotes” formulas. This series of 
formulas (see, for example, reference [1]) begins with the well-known trape- 
zoidal rule and Simpson’s rule and extends through more complicated formulas 
based on approximations to f(z) by polynomials of increasing degree. Any of 
these formulas can be written 
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n n 
t= (b-—a) D ky, where Dk; =1. 
i= t=O 


Here f stands for an estimate of J, and we see that [ is given as a weighted 
average of the observed ordinates y;. The various formulas in the series differ 
only in the choice of weights. 

In order to discuss the error involved in approximations of the above type, 
we let 


Y; = f(z) 
and 
y= Y;+ &, a=0,1,2,---,n. 
Here the «; represent the random observational errors. We then have 


total error = [ — J 
=S+R, 


where 
S=—I1+(b-—a) ¥ kY; 
t=O 
is the bias, or systematic error, and 


R = (b—a) ¥ ke; 


t=O 


is the random error. The error S, usually the only one mentioned in textbooks 
on numerical analysis, represents the effect of the difference between f(x) and 
the polynomial on which the particular estimation formula is based. The error 
R is the effect of experimental error. Assuming, as we shall henceforth do, that 
the «’s are uncorrelated and have a common variance a’, we find that 


var R = (b — a)*o? > k?. 
i=0 


In practice, the value of S cannot be found, though if certain information 
is available about the higher derivatives of f(z) in the interval (a, b), an upper 
bound can be provided for S. (See [1].) This bound tends to diminish as we use 
the more complicated formulas based on polynomials of higher degree. On the 
other hand, since the k; are subject to the restriction that their sum is unity, it 
is clear that var RF is at its minimum when the k;, are equal, and that it increases 
when the variance of the set of k; increases, which is just what usually occurs 
when we attempt to reduce S by using formulas corresponding to polynomials 
of higher degree. 

Thus, when o?+0, attempts to reduce S generally result in an increase in R. 
It follows that the most complex Newton-Cotes formula that we are willing to 
use, though it may minimize the bound on S, does not necessarily minimize the 
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total error. When o? is comparatively large, we should use a simple formula, 
such as the trapezoidal rule. In practice, we may know a bound on S and have 
an estimate of o*. Since these are essentially different kinds of quantity, there 
is no clear way to use them to minimize S+R, but we can nevertheless deter- 
mine, from the orders of magnitude of S and R, approximately which formula 
should be used in any case. 


3. THE USE OF RANDOMIZATION TO OBTAIN UNBIASED APPROXIMATIONS 


In experimental work it is unlikely that there is available the knowledge of 
higher derivatives of f(x) that is necessary to place a bound on S. Even when 
such knowledge is available, it may provide a very poor bound. Statisticians 
have long used randomization to eliminate troublesome biases, and the principle 
can be used in numerical integration as described below. 

We first divide the interval (a, b) into p non-overlapping subintervals B,, 
where 


length B; = Aj, t= 1,2,---,p, 


and so 
P 
A=) A,;=b-a. 
t=] 


These subintervals may also be called “blocks” or “strata.” Inside each B; we 
select g points at random, i.e., we consider x to be a random variable uniformly 
distributed over B; and select from this distribution a random sample of ¢ 
“locations” 


Li, Ti2, * * * » Vig. 


At each location we make r observations, obtaining a set of observed quantities 
Yisk, where 
i= 1, 2, OF 
Yk = Ye ten j=1,2,---,@ (3) 
k=1,2,--+-,r. 
The notation here is as in Section 2. That is, Y;;=f(z,;), and the e; are un- 
correlated errors with mean 0 and variance o*. 


We define yu, and:c,* to be the mean and variance of Y;; (for uniformly dis- 
tributed z;;) over the interval B;. That is, if 


I;= f(x)dz, 
B; 


then 
ui = 1,/A; 
and 
1 
o2 =— |] (f(x) — ui]*dz. 
Ai B; 
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It follows that 
" A; 
T= —D vin 
Qr ik 
is the minimum-variance linear unbiased estimator of J;, and we shall use 
T= D1 
i 
A; 


> — Vise 


ijk «QT 


as our (unbiased) estimator of I. 
The following linear model provides a basis for analysis of variance: 





Yise = w+ BiH Oia) H Cage. (4) 
Here 
uw = mean of f(x) over (a, b) 
l 4 
= Be -f f(x)dx 

Bi = wir 

$53) = Yi — (u + Bi). 
We have 


“ A;? 
var J; = —— var D> vin 
q’r* jk 


A;? 
= — (ro? + 0°), 
qr 
and so, since f is the sum of the independent /,, 
a 1 
var 7 = — >> A?*(ro,? + 0°). (5) 
qr i 
Suppose now that we carry out an analysis of variance on the numbers 
y in = (pAi/A) yin, 
where A=b—a. Then if we let 
1 


va = @-Da Ls — 1)’, 


1 
0? = 7 X A,*o,?, 
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and 
1 A\? 
mee ay em 
(p — 1)A? Pp 
we have the results shown in Table 181. 


TABLE 181 
ANALYSIS OF VARIANCE 








Sum Degrees 
Source of Estimate of of Expectation of Mean Square 
|Squares| Freedom 





Between blocks SSB | p-1 o*{1+p(p —1)04*) + proy?+p*qroa? 
Between locations, within blocks | SSL | p(q—1) |e*{1+p(p—1)ea*]+pro? 
Within locations SSW | pq(r—1) |e*[1+p(p—1)ea?] 














In this table the sums of squares are as usual, i.e., 
SSB = gr Di (y':.. — y’...)? 


SSL =r >> (y's. — y'x..)? 
ij 


SSW pe (y’ se — y’.)?. 


ijk 
In the particular case where the A; are equai, then of course o,?=0 and the 
expectations of mean squares become the same as in an ordinary nested- 
classification analysis. 
If we let 


stand for the mean square corresponding to SSL, it is clear that 


A? ¢ 

E(— MSL) = var 7, 
pqr 

and this is the chief reason for having done the analysis of variance on the 

y’ ix. An unbiased estimator of ¢? is provided by the last line of the table, 

although a better estimator comes from the corresponding expression for the 

Yisk- 

In this section we have provided an unbiased estimator, /, for the integral J, 
and we have just developed an unbiased estimator for the variance of 7. Since 
T is a weighted mean of pgr observations, it will generally be approximately 
normally distributed when per is large; hence in this case we can compute 
approximate confidence limits for I. 

In the next section it will be shown how the results of Table HO can be used 
in planning further experimentation. 
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4. REDUCTION OF ERROR 


Given a total number, say N, of observations, there is always the problem 
of how best to distribute them. This is a well-known problem in stratified 
sampling, and what we shall have to say in this section is simply the usual 
thing as applied to the integration problem. 

We consider first the number r. It is conceivable, of course, that r could be a 
function of i and j (i.e. of block and of location within block). There is no visible 
advantage in allowing this to happen, since, after the first observation is made 
at each location, every additional observation supplies the same additional in- 
formation about o*, regardless of where it is taken. Hence, since different r’s 
would serve only to complicate the analysis, we take them to be equal. It is 
clear from formula (5) that, for a fixed gr, we should take r as small as possible. 
Thus we should usually take r=2 (the minimum number if we wish to estimate 
o*) unless for some reason we want to spend more degrees of freedom on the 
estimation of o°. 

We can come to the same conclusion about gq, as follows. Suppose we estimate 
I in two ways, first using only one interval with g locations, and second, allot- 
ting the g locations to two subintervals. If V; and V; are the respective variances 
of the two estimates so obtained, and if we can insure that Vi:> V2, then we 
have shown that g should be made as small as possible. To do this, we suppose 
that the two subintervals receive q and gq: locations, respectively, with q:+4¢: 
=q, and that 


A; = qiA/q i= Zk 2. 


Then we have 


A? 
— (ro + o*) 
qr ° 


A? 
= —(% or + Sarto), 
qr\q q 


1 L) 
of? —f [f(z) — w}*dz. 
Ava 
In order to show that Vi>Vs2, then, it is necessary only to show that 
a 
oj? > he | 2 + & o:?. 


This result follows from application of a little algebra to the definitions of 
o,*, o1’, and 27. 

It follows from the above discussion that we should take q as small as possi- 
ble. Hence we would normally take q=2, this being the smallest value permit- 
ting an estimate of var [. 

If N is fixed, a choice of g and r determines p. Hence the only remaining 
choice is in the lengths A; of the p blocks B;. Since 
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l 2 
var f = — > A;*e/? rsa > A’, 
a qr i 


it is clear that when o? is large our chief problem is to minimize the second term, 
which we can do by taking the A; to be equal. Otherwise not much can be said 
unless we have knowledge about the function of a type which is not usually 
available. If the function is linear, o,* is proportional to A,’, so again the best 
subdivision is that which makes all A; equal, but this is a case of no great in- 
terest except insofar as we may be dealing with a function which is known 
to be approximately linear. 

In case one experiment has been run, we may draw conclusions about how to 
run a second experiment. The two terms in 


1 a 
var T ae “to > Ato? + ~~ > A;? 
q i qr i; 
are estimated, respectively, by 


A? 
-— (MSL — MSW) 
pqr 


and 
2 


A 
— MSW. 
pqr 


If the latter is much larger than the former, we can expect to reduce var f 
materially only by increasing the total number of observations or by somehow 
reducing the experimental error. If the former is much larger than the latter, 
we should look for the possibility of obtaining a smaller var f by taking more 
observations in those blocks where o;? (estimated) is large, or by subdividing 
these blocks into smaller ones in which we continue to take q observations per 


block. 
5. MEASURING GOODNESS OF FIT 


In a great deal of experimental work in the physical sciences, a theory is 
available for prediction of the response y as a function of n variables x, 
Xe, ** +, ». An experiment is performed to see if the theory is valid. Signifi- 
cance tests, so often used in this area, do not provide what is wanted at all; a 
small sample is almost sure not to give a significant result, while a sufficiently 
large sample will almost always show a significant difference between theory 
and fact. The experiment whose end result is a significance test can do little 
but support the belief, prevalent in some quarters, that statistics is a nega- 
tivistic science. 

Decision theory goes one step further, in that it takes into account, via a 
loss or risk function, the cost of adopting a theory as a function of the amount 
of discrepancy between theory and obssrvation. The author, however, does not 
accept the point of view that all problems are decision problems, except per- 
haps in a trivial sense. Frequently an experiment is performed in order to pro- 
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vide a basis for a decision made by someone other than the experimenter, or 
even for a number of future different kinds of decision to be made by the ex- 
perimenter or by others. The problem of the experimenter, then, at the time 
he summarizes his results, is not to make a decision, but to describe his results 
in such a way as to facilitate decisions made at a later date 

What is needed at this point is a way of estimating some measure of the 
discrepancy between theory and fact. Then if one wishes to use the theory in 
some series of computations to arrive at a prediction, he can take the estimated 
discrepancy into account in estimating the accuracy of his prediction. Clearly 
the usefulness of the theory will depend, not only on the amount of discrepancy, 
but also on the nature of the series of computations. Hence at the time of the 
performance of the experiment, it may be meaningless to decide whether the 
theory is “close enough” or not. 

Suppose now that f(x) and h(x) are, respectively, the unknown “true” func- 
tion and the theoretical, or hypothetical, function, each defined over the in- 
terval B=(a, b). The “distance” between the two functions over B will be 
denoted by d[f(x), h(x); B], where 


1 h 
a*[f(z), h(x); B] = —~—f [f(z) — h(x) Pax. 


In order to estimate this distance in any given case we proceed to take ob- 
servations as we did in the integration problem. That is, we divide B into p 
subintervals B;, select g random locations in each B;, and take r observations at 
each location. We suppose that the observation y;; made at location 2z;; can 
be written 


$= 1,2,°--,p 
Yise = Vig + ese, j=1,2,---,¢@ 
k=1,2,-++,r 


where Y,;;=/(2;;) and the e’s are uncorrelated and have zero mean and a com- 
mon variance o?. Our particular interest is in the quantities 


Zk = Yair — h(zis), 
which we can also write as 
Zi = Liz + ein, 
where Z;;= Y4;—A(2i;). 
We shall write 

d? = d*[f(x), h(x); By 
and 

d;? = d*[f(x), h(x); Bi). 


Since the z,;; are random variables, so are the Z;;, and as d;* is the mean of 
[f(z) —h(z) }?, it follows that 


E(Z?;;) = d;?. 
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We shall use the following notation: 


T% = E(Z4;;) 
Q = E( Ain) 
A; = length of interval B; 


A= >A; 
tise = Aiz ine 


We have 


dad? 


1 
i 3 a Ad,’ 
A % 
and so it is easily shown that 
ra 
E| — bi tije = q? + o. 
grA ijk 
Hence if é? is any unbiased estimator of o?, we have in 
1 
d? = — Dit - 
qrA ijk 


an unbiased estimator of d*. We shall take 





1 
= —_—_— > (Yaje = ys.)*, 
pair — 1) im 
so that 
9 1 . 
Penis Be ten = sched) Btls 
qrd ‘<x pqa(r — 1) i 


It is only a matter of some lengthy algebra to show that 


1 4 2% —1 
var d? S oes > ae(x — d; -b ee od.) + ot P o472(Q am o*), 
gh? ; r par(r — 1) qr 





where o,? is as in Section 4. 

We next perform an analysis of variance on the numbers ¢,;.. Results are 
summarized in Table 186. 

In the notation of Table 186 we have 


RO Oe. See ae ae 
l ———_ + — (p- —¢ , 
gra? par(r—1) gr ‘ 





var d? = 


Table 186 does not provide a simple estimator for var d? except in the very im- 
portant special case where 


Q = 3e%, 
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which is the case when the ¢’s are normally distributed, and where 

r=2, 
which is a likely choice for r for the same reasons advanced in Section 4. In 


this case, 


Pp 
var =~ (J + K + M) 


~- 


p 
2qA? 





E(MSL), 


where MSL is the mean square corresponding to SSL. Hence, when r=2 and 
Q=3c', we have in 


p 
2qA? 





an unbiased estimator of var d@?. 


REFERENCE 
[1] Milne W. E.. Numerical Calculus. Princeton: Princeton University Press, 1949. 


TABLE 186 
ANALYSIS OF VARIANCE 











Source of Degrees of | Expectation of 
Estimate Sum of Squares Freedom Mean Square 
Between blocks SSB=gqr >> (t..—t...)? p-l 
Between locations, SSL=r > (ti. —ti..)* p(q—1) J+K+M 
within blocks ii 


Within locations SSW = > (tin —tij.)? pq(r —1) J+K 


ijk 
2 
J wie > avd? 
Ps 


—o 





x-2 [p(p —1)o42+1] A? 











M -— » Ai(r; —d;4) 











SOME ASPECTS OF THE USE OF THE SEQUENTIAL 
PROBABILITY RATIO TEST* 


M. H. DeGroot 
Carnegie Institute of Technology 
AND 
Jack NADLER 

Bell Telephone Laboratories 


The properties of Wald’s sequential probability ratio test of a simple 
hypothesis about the mean of a normal population against a simple 
alternative are well-known. However, the applicability of the proce- 
dure is limited in practice because it requires exact knowledge of the 
population variance. The behavior of the test when an erroneous value 
is taken for the variance is investigated in two applications: (i) when 
inferences are to be made about the mean itself, and (ii) when inferences 
are to be made about the proportion of the population whose measure- 
ments exceed a given constant. Use of the test is discussed for each 
application when the variance can be restricted a priori to a given, 
finite interval. In such a situation, it is sometimes possible to apply the 
Wald test, using an assumed value for the variance. A method for 
doing this is presented which controls the risks of error and for which 
approximate bounds on the average sample size are easily found. 


1. INTRODUCTION 


ONSIDER a situation in which a sequential procedure is appropriate for 

testing certain hypotheses about the mean, u, and variance, o’, of a normal 
population. Two types of problems that are frequently encountered are: (i) 
testing that the value of u is uo against the alternative that the value of yu is 
M1 (41> mo), and (ii) testing that the proportion defective, p, is po against the 
alternative that p is p: (pi1>po), where the proportion defective is defined as 
the proportion of the population whose measurements exceed a certain given 
constant. The essential difference between these two problems is that in (i) we 
are interested in the characteristics of any applicable procedure as functions of 
u, Whereas in (ii) we are interested in these characteristics as functions (after 
a translation) of u/c. 

When both the mean and variance are unknown, various procedures, com- 
monly referred to as one-sided sequential t-tests, have been proposed for the 
second problem, e.g., the WAGR test and approximations to it, [2], [4], [6], 
[8], and [10]. To our knowledge, however, only one author, Baker [1], has 
treated the first problem in this case. 

If the variance is known, the hypotheses of (ii) can be written in the same 
form as the hypotheses of (i). Consequently, the Wald sequential probability 
ratio test (SPRT) on the mean of a normal distribution, described in Section 
2, is appropriate for both problems. The desirability of the Wald SPRT in this 
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context results not only from its optimal properties, but also from the facts 
that its operating characteristic (OC) and average sample number (ASN) 
functions are known to within reasonable approximation and the test itself is 
easily carried out. Since the other procedures mentioned above do not enjoy 
all these advantages, one may be tempted to use the SPRT in situations where, 
although the variance is in fact unknown, a “reasonable” guess of its value can 
be made from available information. 

In Sections 3 and 4 the sensitivity of the SPRT procedure to departures of 
the variance from its assumed value is studied for the two problems. Pro- 
cedures using the SPRT are proposed in Sections 5 and € for each problem when 
the variance can be restricted a priori to a known, finite interval. 

The study is based in part on the usual Wald approximations to the OC and 
ASN functions, given in Section 2. There has been some numerical investigation 
of the quality of these approximations, e.g. [1], [7], and bounds on their error 
have been derived by Wald ir the Appendix of [9]. The fact that the theory is 
exact for the analogous problem with a continuous time parameter, [3], [5], 
gives added support to the expectation that the behavior of the approximating 
functions is not unreasonable. 


2. THE SPRT pRocEDURE 


Let the random variable X have a normal distribution with mean u and posi- 
tive variance o*, and let o,4? be a given positive constant. Suppose the SPRT 
procedure assuming o?=¢,? is used to test the hypotheses 


bes = Ho 


(1) 
Ay: w= ph, wh > po. 


Denote by 2; the value of the 7th observation in a sequential random sample 
of observations on the random variable X. Let f(z, u, 0?) denote the ordinate 
at the value x of the normal density function with mean yu and variance o’. 
To perform the test, observations are taken sequentially as long as 

b< > Neg re fe SA) <a (2) 
i=l Sli; Ho, oa”) 
where a and b are two constants with b<0<a. The procedure is terminated 
with acceptance of H, if the left-hand inequality is broken first, or with accept- 
ance of H, if the right-hand inequality is broken first. 

Let a be the probability of rejecting H» when it is true, and let 6 be the proba- 
bility of rejecting H, when it is true. Thus, a and @ are the error risks of first 
and second kind. When the population variance o? is in fact equal to o4? then 
a and 6 satisfy 


’ < log ((1 — 8)/a) 3) 
b > log (8/(1 — a)). 
The inequalities (3) imply that 
ik (4) 
B<e. 
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The approximate Wald OC and ASN functions P(u) and N(u, o4?) are 
respectively, 











ab come (5) 
, ae ea — ebn 
and 
F 204 (a —-b)P(u) — a 
N (u, v4”) -- = 2 (6) 

(41 — po)? ” 

where 
+ po — 2 
2 ee (7) 


Mi — Mo 


When u = (uo+u1)/2 the above expressions are indeterminate, but their limiting 
values are easily found to be 





+ a 
p(® —*) ih eee (8) 
2 a-—b 
and 
0 + cate b 
N (“. =e c.*) a a ~ A (9) 
2 a Ho)? 


Note that (5) yields approximations for the error risks a and §; i.e., 
aiYtli- P(uo) ; 
{ (10) 


B = P(u). 
A complete description of this material is given by Wald in Chapter 7 of [9]. 


3. THE SENSITIVITY OF THE TEST TO DEVIATIONS OF o”* FROM o,” 


If o?o,4*, this procedure will not be a SPRT, since f(x; u;, o4) is no longer 
the density function of X under the hypothesis H; (¢=0, 1). Nevertheless, it is 
an easy matter to derive approximations to the OC and ASN functions of the 
test for any particular value of o?. Set \=o.4/o and note that (2) holds if and 
only if 








n Cy ’ 2 
Mb <a? Slog AMD & yg (1) 
i=l F (xi; bo,*o 4”) 
It is easily verified that 
- (ij May OA”, : ij; May 
> log = oo? = > log Assi a (12) 
i=l S (xi; wo, 74?) i=l f(s; wo, 0”) 


so that (11) may be written in the convenient form 
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bt < > log f(x; M1, o*) 


, 13 
i=l f(x; Ho, o*) P On 


where b* =\*b and a* =)*a. 

The left-hand (right-hand) inequality of (13) is broken if and only if the 
left-hand (right-hand) inequality of (2) is broken. Hence, when o?#¢,’, the 
test procedure is equivalent to the SPRT indicated by (13) and it follows that 
the risks, say a* and §*, satisfy 


r < log ((1 — 6*)/a*) 


(14) 
b* > log (8*/(1 — a*)), 


a* < ee = e-on” 
{ (15) 


pt se =e. 


Also, by this equivalence, the approximate Wald OC and ASN functions for 
the SPRT serve as approximate OC and ASN funciions for the given test 
procedure. 

These functions will be denoted by P(u, o*; 042), and N(u, o*; o4*), respec- 
tively, where, for simplicity, the arguments a, b, wo, and yw; are suppressed. 
Reference to equations (5)-(9) shows that for each fixed value of o* and 
uw (uot+n1)/2, 


evr — ] ens — 1 


eo — obs “f er" — ebd*n 





P(u, 0? ¢4*) = (16) 


20? (a* — b*)P(u, 07; ¢47) — a® 
(ui — mo)? ” 
2c,4? (a — b)P(u, 07; 047) —a 








(41 — po)? 


Furthermore, 








—a*b* — ‘ab 
y(@=*, of: o1') " a * (19) 
2 (41 — po)? (m1 — po)? 


These results were given by Baker [1] in a slightly different context. 
1—P(uo, 0°; 74") and P(u, 0”; a4”) will be called the approximate risks, while 
1—P(uo) and P(u:) of (10), which in the present notation might also be 
written 1—P(yo, 47; 74”) and P(y, 747; 74”), will be called the design risks. 
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It is now evident that for a fixed value of u¥(uot+m:1)/2, P(u, 0, o4?) is a 
strictly increasing (decreasing) function of \ if 7 >0 (7 <0). Consequently, both 
approximate risks are less (more) than the corresponding design risks whenever 
A>1 A<1). If w=(wotm1)/2 (i.e., 7» =0) P(u, o?; v4?) is independent of X. In 
addition, N(u, o?; 04?) is always a strictly decreasing function of o°. 

To give an idea of the magnitude of the effects which may occur when 
o*#c,*, the approximate risks and the ratio of the approximate ASN, 
N(u, 07; v4”), to the design ASN, N(u, 04”), at the hypothesis points are given 
in Table 191 for some values of \ when the design risks are both equal to 0.05, 
and both equal to 0.10.These results seem to indicate that the SPRT of the 
mean of a normal population is quite sensitive to moderate relative departures 
of the variance from its assumed value. 


TABLE 191 


APPROXIMATE RISKS AND RELATIVE CHANGE IN THE ASN 
AT THE HYPOTHESIS POINTS, WHEN o*#c.? 








1—P(uo, 0%; cat) = | N (uo, 0°; 04?) ul N(o1, 0; 74°) 
P(ui, 0; oa) N (wo, o.4?) N(u, o4?) 





A=e4/oe 





.160 .76 
.070 .96 
.050 .00 
.035 .03 
.010 .09 





.225 .69 
.127 .93 
.100 .00 
.077 .06 
.031 DY i 











ojala” Som eimae om 





The same problem arises, of course, in the fixed sample test on the mean of 
a normal distribution, when o” is assumed known. It is worthwhile to note that 
the actual risks of the fixed sample procedure, resulting from departures of o? 
from its assumed value, behave similarly to the approximate risks in Table 191. 
The values of the actual risks are easily found for any given value of \. For 
example, for design risks both equal to 0.05 the actual risks turn out to be 0.11 
when \ = 3/4 and 0.02 when \=5/4. 


4. THE PROPORTION DEFECTIVE PROBLEM 


In a proportion defective problem it is desired to make inferences about the 
proportion of the population whose measurements lie above a certain given 
constant, say U. For the population associated with the normal random vari- 
able X, the proportion defective, p, is defined by 


og 1 
a. f ewes, (20) 
vu V2x0* 
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To each value of p, there corresponds 4 unit normal deviate, K, which is defined 
by 
co) l . 
p= f == @~* /*gy, (21) 
Kk V20 


An easy consequence of this definition is that 


Suppose one wishes to test the hypotheses 
” o: P= Po 
Hi: p=, Pi > Po 


or, equivalently, 


fo K = Ko 
A: K = K,, K, < Ko. 
If the population variance is known to be c,4?, the SPRT of the mean of a 


normal population, with limits a and b, may be used. The hypotheses to be 
tested become 
Hy: 4 = = U — Kooa d 
: (25) 
Ay: p=m = U — Kia, 
using the relation (22). 

If the variance is assumed to have the value c4? when in fact o?¢,?, this 
procedure again will not be a SPRT. However, as in Section 3, the test pro- 
cedure is equivalent to a SPRT with limits a*=a)? and b* =b\* though the 
discrepancy between o? and co,’ now further manifests itself in that the corre- 
spondence (22) between uw and K used to obtain the hypothesis points, uo and 
#1, is based on an erroneous value of the variance. 

Suppose then that o*?#¢,?. Using relation (22), the expression (7) for » can 
be rewritten as 

(U — Koos) + (U — Kyou) — 2(U — Ko) 
(U — Kyo4) -_ (U —_ Koo) 
2Ke ae (Ko + Ki)oa 
(Ko bee’ Kio 
2Kyx — (Ko _ K,) 
Ky — Ky 





9 = 








It now follows that substitution into (16) and (17) gives the approximate OC 
and ASN functions of the test procedure, written as functions! of K: 


ev (K Ada = } 





P(K, o) = (27) 


, 
ev (KA)a pare eo (KA)b 





1 This result had been obtained previously, although without the knowledge of the authors, by W. H. Kruskal. 
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where 

2K — (Ko + Ki)? 
, 


K,\) = >t = 
g(K,) = 9 (kK. — K) 





2d (a — b)P(K,\) —a 
Ko = K, 2K = (Ko + K)) 
Again, at the indeterminate value K = K,=(K,+K;,)\/2 we have 


N(K,) = 





P(K,, \) = 


a-—b 


—d‘ab 
N(K,, d) = . (31) 
o4*(Ko Pe K,)? 

It should be noted that both of the functions (27) and (29) depend upon oy, 
and o only through their ratio, \ . By analogy with the discussion of Section 3, 
we call 1—P(Ko, 1) and P(K,, 1) the design risks of the first and second kind. 
and 1—P(Ko, \) and P(K,, \) the approximate risks of the first and second 
kind. 

We now investigate the qualitative behavior of P(K, \) in some detail, using 
the fact that P(K, X) is a strictly increasing function of g(K, \), and that its 
behavior will depend upon the sign of Ko+K;,: 

(i) Ko+K,=0. In this case, g(K, \) =AK/Ko. Hence, for a fixed value of 
K #0, P(K, X) is a strictly increasing (decreasing) function of \, K>0(K <0). 
It therefore follows that the approximate risks are both greater (less) than the 
design risks, whenever \> 1(A <1). Furthermore, N(K, \) is a strictly increasing 
function of \ for all K. 

(ii) Ko+K,>0. When Ko+K,+0, g(K, X) is a parabolic function of \ for 
each given value of K. Examining the specific functional form it can be seen 
that in this case the approximate risk of the first kind is less than or equal to 
the corresponding design risk if and only if \ is between the values 1 and 
(Ko— K,)/(Ko+K;,). In addition, the approximate risk of the second kind is 
smaller than the corresponding design risk if and only if \>1. The behavior 
of the function N(K, \) is more complex, since the values of \ for which N(K, \) 
increases depend upon the magnitudes of the design risks as well as Ky and K,. 
Numerical examples are given in Table 194. 

(iii) Ko +K,<0. This case can be studied by analogy to (ii). Here, the pa- 
rabolas, g(Ko, \) and g(K,, dA), are convex rather than concave. It follows that 
the approximate risk of the first kind is less than the corresponding design 
risk if and only if \>1 and that the approximate risk of the second kind is less 
than the corresponding design risk if and only if \ is between the values 1 and 
(Ki—Ko)/(Kot+)). 

To give an idea of the magnitude of the effects which may occur when 
o?o.42, the approximate risks and the ratio of N(K, \) to N(K, 1) at the 
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hypothesis points are given in Table 194 for two examples in which Kyo+K,>0. 
K,>0. It is easily seen from equations (27)—(29) that, at the hypothesis points, 
P(K, \) and N(K, \)/N(K, 1) depend upon Ky and K, only through their 
ratio, Ko/K,. Values are tabulated for Ko/K;=1.529 with design risks a=8 
=0.05 and a=8=0.10. 

It is evident from Table 194 that the approximate risks are extremely 
sensitive to departures of o from o,?, at least for certain values of Ko and K,. 
It may also be instructive to compare Table 191 with Table 194. The effect 
of assuming an incorrect value of o? appears to be much larger in this problem 
than in the test on the mean alone. The monotonicity of the approximate risks 
exhibited in Table 194 is due to the fact that the tabulated values of \ do not 
cover the point at which the maximum of g(K, \) occurs. 


TABLE 194 


APPROXIMATE RISKS AND RELATIVE CHANGE IN THE ASN AT 
THE HYPOTHESIS POINTS WHEN o?#e4? AND K,/K,=1.529 








1—P(Ko, 1) =P(Ki, 1) =0.05 1—P(Ko, 1) =P(Ki, 1) =0.10 





N(Ko, 2) N(K, ») N(Ke, ) N(Ki, ») 
—_——— P(K,, ») ag P(K,, )}——_— 
N(Ko, 1) N(R, 1)| P(Ko N(R, 1) 


l— 
P(Ko, d) 











3/4 .008 .37 ; 91 .026 . .580 
7/8 .016 59 ‘ -15 .044 " .316 
15/16 .027 -76 ; ll -064 , .191 
31/32 .036 .87 ‘ ’ -080 
1 .050 -00 ‘ j .100 
33/32 .070 -16 ‘ ‘ .127 
17/16 .100 35 : ‘ 163 
9/8 -209 81 * é .270 
5/4 .672 44 . < .631 





























5. THE USE OF A PRIORI KNOWLEDGE ABOUT THE VARIANCE IN THE 
TEST ON THE MEAN 


Consider the problem of testing sequentially the hypotheses 


oe = Mo 


(32) 
Ay: p= mM 


when the variance o? is unknown, but is known to lie in a finite interval, 
0<¢1?<o*<eyp’. It is required that the actual error risks of the procedure to 
be used do not exceed two given positive constants a» and Bo, ao+8o<1. 

Consider a fixed number ¢,? to be substituted into the Wald SPRT in place 
of the unknown o*. We now ask if there exist numbers a>0 and b<0, to be used 
as sequential limits for the test statistics. 


D log [f2s; m, o4°)/f(as; wo, o42)], (33) 





SEQUENTIAL PROBABILITY RATIO TEST 195 


such that the resulting error risks satisfy the conditions given above. The fol- 
lowing theorem demonstrates the existence of such limits and exhibits a reason- 
able choice of their values. 

Theorem 1. Corresponding to a given value o4?, there exist numbers a>0 and 
b<0 such that when used as sequential limits for the statistics (33), the result- 
ing error risks of the first and second kind do not exceed ap and o, respectively. 

Proof: Since o1?<o*?<ap*, then \*=¢,?/c? must lie in the interval o42/cy? 
<\ <o,?/o,?. For any given choice of a and b, the error risks of the resulting 
test procedure depend upon X. The risk requirements impose on a and b the 
conditions 


heyy [error risk of the first kind] < ao, (34) 


Max,: [error risk of the second kind] < Bo. 
From (15) it is seen that these conditions will be satisfied if 


— [e-*] < ao, 
Max: [e*] < Bo. 


Thus, the risk requirements are met for any choice of a and b such that 


Fes < a, 


ee ley) < Bo. 
Among all possible values for a and 6 satisfying (36), it is clear that 


. = (¢v?/o4*) log (1/a0) 


(37) 
b(¢4*) = (ev*/oa*) log Bo. 


is optimal, since any other choice (a, b) will bracket (37), and therefore lead to 
higher ASN values. 

It is interesting and reassuring to observe that the approximate OC and 
ASN functions (16) and (17), computed for the procedure with limits (37), are 
free of 4”. In other words, the approximate characteristics of the recommended 
test procedure do not depend on the initial choice of o4?. Other advantages of 
this procedure lie in the facts that the error risks never exceed ao and 8», and 
that approximate bounds for the ASN function are easily found for any a» and 
Bo. In addition, if the interval o,?<o*?<ay’ is small this procedure may yield 
savings in the ASN for the possible values of o? over alternative tests that do 
not utilize the a priori knowledge. The consideration that the test is easily 
carried out also supports the use of the above procedure. 


6. THE USE OF A PRIORI KNOWLEDGE ABOUT THE VARIANCE IN THE 
PROPORTION DEFECTIVE PROBLEM 


In this section the problem considered is that of testing the hypotheses 


4g K = Ky 


(38) 
Hy: K = Ky, K, < Ko 
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when the variance, although unknown, is known to lie in a finite interval, 
0<o,2<0?<oy*. (Note that the value o,?=0 is excluded.) It is required that 
the actual error risks do not exceed two given positive constants, a» and Bo, 
ao+B8o<1. Again, it seems reasonable to utilize the information about the 
variance by considering SPRT procedures. 

Consider then a fixed number o4?, to be used throughout the test in place of 
the unknown o*. From (25), the test statistics become 


y log [f(2xe; U — Kyoa)/f(ai; U — Koos, oa”) | (39) 
t=1 
and we ask if there exist numbers a>0 and b <0, to be used as sequential limits, 
such that the resulting error risks satisfy the specified conditions. 
For a given choice of o4?, a, and b, let the error risks of the first and second 
kind be a* and @*, respectively. Upper bounds for a* and §* are easily obtained, 
once it has been noted that 


g( Ko, A) p log [f(ai; U — Kioa, oa?)/f(ai; U — Kooa, oa*) | 


t=1 


D log [f(xs; Ci, o*)/f(ai4; U — Koo, 0°) |, 


tml 


—g(K1, d) >> log [f(ai; U — Kies, 042) /f(ai; U — Koos, 04) 


i=l 


= > log (f(z; U — Kyo, o*)/f(x;i; Co, o°) |, 


t=1 


where Cy and C, are constants whose precise values are immaterial. 

For a given value of o, suppose g(Ko, \)>0. By analogy to Section 3, 
the given test procedure is equivalent to the SPRT of the hypotheses 
Ho: «= U—Koo vs. Hi: 1=C; that uses (40) with limits ag(Ko, \) and bg( Ko, \). 
It follows that a* is equal to the error risk of the first kind for this SPRT. As a 
consequence of (4), 


a®* < e-e(Ke, »), (42) 


On the other hand, if g( Ko, 4) <0, (42) holds, since then e~*#**. >1. A similar 
argument using (41) and (4) shows that 


p* < e700 (Ki, A) | (43) 


Thus, we have 

Theorem 2. If a* and §* are the error risks of the first and second kind for 
the sequential test procedure using (39) with limits a and b, then a* <e—*(Ko) 
and B* <e~teKi. »), 

Furthermore, \ must lie between o4/oeyv and o4/o1, since o,2<0?<oy*. The 
risk requirements, therefore, will be satisfied if we can find values of o4, a, and 
b, such that e~*“**. Cao and e~*e(X1.* <8, for all possible values of \. These 
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conditions are clearly equivalent to Min, g(Ko, \)> (i/a) log (1/ao) and 
Max, g(Ki, A)<—(1/b) log Bo. If for a given o,4, there exist values a and b 
satisfying these inequalities then there exists a pair (a, b) for which equality 
holds. This choice is optimal, since any other pair satisfying the inequalities 
will bracket the given pair and thus lead to higher ASN values. Hence we 
need to choose o,4, a, and b so that 


i g(Ko, X) = (1/a) log (1/a0), (44) 


Max, g(Ki, 4) = — (1/b) log Bo. 


But if 


Min, g(Ko, A) > 0, 
{ » g( Ko (45) 


Max, g(&i, A) < 0, 


then there always exist constants a>0 and b<0 satisfying (44). The problem 
is thus reduced to an investigation of the existence of a value o4 satisfying (45). 
This depends oniy on the functional form of g(K, ), which has been partially 
investigated in Section 4. Indeed, referring to (28), the following theorem is 
easily verified, case by case: 

Theorem 8. Let o12<o0?<ov? and let \=o4/c. 

(i) If Ko+K,=0, any value o,>0 satisfies (45). 

(ii) If Ko+K,>0 and Ky>O>K,, any value of o4 such that 0<04 <(2Ky 
/(Ko+K)))orx satisfies (45). 

(iii) If Ko +K,<0 and Ky>0>K,, any value of a4 such that 0<04< (2K, 
/(Kot+K)))ox satisfies (45). 

(iv) If Ko>K.i>0O, any value of o,4 such that (2K,/(Ko+K:))ou <o, 

<(2Ko/(Ko+K;))ex satisfies (45). There exist such values of o4 if and only if 
ov/o.<Ko/K,. If any such values of co, do exist, then o4=2eyver/(ov+or1) 
satisfies the inequalities. 

(v) If 0O>Ko>K,, any value of o4 such that (2Ko/(Kot+K,))ou <4 

<(2K,(Ko+,))ex satisfies (45). There exist such values of o4 if and only if 

ov/o.<K,/Ko. If any such values of o,4 do exist, then o4=2eye,/(ov+e_,) 
satisfies the inequalities. 

The situation most often encountered in practice is that in which Ky> K,>0. 
We now summarize the ideas contained in Theorems 2 and 3 for this case. 

Accordingly, suppose we wish to test the hypotheses (38), with Ko>K,>0, 
when the unknown population variance can be restricted to an interval 
a2 <0? <oy? such that cy/o,<Ko/K;,. It is required that the actual error risks 
of the first and second kind do not exceed a» and 8, respectively. Part (iv) of 
Theorem 3 states that there exists a sequential test procedure, based on the 
statistics (39) with o4=2eve,/(ov-+e_), and test limits a and b, that will fulfill 
the risk requirements. The limits a and b are foun by solving equations (44). 
From (28), and a sketch of g(Ko, d), it is seen that the minimum value of 
g( Ko, \) occurs at the largest value of A, namely \ = ¢4/o1. Similarly, the maxi- 
mum value of g(K:, \) occurs at \=o4/ov. Thus, letting g=g(Ko, ¢4/o1) and 
gi=g(Ki, o4/ov), equations (44) state that 
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{’ = (1/go) log (1/a) 
b = (1/g:) log (1/Bo) 


Thus, the test procedure based on (39), with limits (46), satisfies the given 
error risk conditions. 


(46) 


7. NUMERICAL EXAMPLES 


We now give two numerical examples illustrating the use of the procedure 
discussed at the end of Section 6. Approximate OC and ASN values for the 
recommended test procedure are computed from (27) and (29). 

Example 1. Ko= 1.960, K,= 1.282, Ko/Ki=1.529, ov/o1=1.25, ao =8o=0.05. 
Using o4=2e10u/(¢1+ev), we have o4/ov=0.889, o4/o0,=1.111. Solving for 
a and b from (46) gives a=5.74, b=7.20. Approximate OC and ASN values 
when A\=1, A\=e4/ev, and \=e,4/e1, for the test procedure using the above 
values for o4, a, and b are given in Table 198. 











TABLE 198 
APPROXIMATE OC AND ASN VALUES FOR EXAMPLE 1 
»r P(Ko, dr) P(Ki,, d) N(Ko, d) N(K,, ») 
1.111 .951 .000 68 18 
1.000 .997 .001 31 25 
0.889 1.000 .046 18 42 

















An empirical study of the WAGR test carried out at the Statistical Rese rch 
Center, University of Chicago, gave, for this case, ASN values of about 35 and 
30 at Ky and K,, respectively. Comparing these values with those given in 
Table 198, it is seen that the procedure presented here yields smaller ASN 
values in the neighborhood of \= 1, but yields higher values at both Ky and K, 
for extreme \. It should be noted however that ASN values quoted for the 
WAGER test, estimated from samples of 500 observations, contain only a small 
sampling error, whereas the ASN values given in Table 198 depend upon the 
validity of the Wald approximations. On the other hand, the test presented here 
insures that the stipulated error risks are not exceeded, whereas the WAGR 
test fulfills the risk requirements only approximately. 

The next example treats the same case as Example 1, except that o? is now 
restricted to a smaller interval. 

Example 2. Ky= 1.960, RK; = 1.282, K,/Ki= 1.529, ou/eL= 4.3, ao = Bo = 0.05. 
Using o4=2eve1/(ev+er), we have o4/ov =0.952, o4/0,=1.048. Solving for 
a and b gives a=3.71, b= —4.10. The approximate OC and ASN values for 
this test are given in Table 199. 

These values compare favorably with those given for the WAGR test in 
Example 1. Thus, if ¢ is restricted to an interval such that ov/o,<1.1, the 
recommended procedure appears to be uniformly better, but again the com- 
ment made in Example 1 is appropriate. 
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TABLE 199 
APPROXIMATE OC AND ASN VALUES FOR EXAMPLE 2 














d | P(Ke, ») P(K;,») |} N(Ke, ») N(K,, ») 
1.048 952 005 22 14 
1.000 .976 .016 17 16 
0.952 .987 .047 ‘ 13 17 
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KENNETH JOSEPH ARROW is professor of Economies and Statistics at Stanford 
University, a rank he attained in 1953 at the age of 32. He received his bachelor’s 
degree in social science from City College in 1940, his master’s in mathematics in 
1941, and his doctor’s in economics in 1951, at Columbia. During the Second World 
War he did meteorological research as a Captain in the Air Force, and then was a mem- 
ber of the Department of Economies at the University of Chicago from 1947 until 
going to Stanford in 1949. In 1956 he was President of the Econometric Society, and 
in 1957 he received the John Bates Clark medal of the American Economic Associa- 
tion. His best-known publication is his book Secial Choice and Individual Values, 
(Wiley, 1951). The present article is his first in the JASA, but he has published a 
number of well-known articles in other journals. Currently his main interests are 
economic theory, programming, theory of the planned economy, and behavior under 
uncertainty. The present review article grew out of a request for an ordinary review. 
He was especially interested in Tinbergen’s book because he “had visited the Dutch 
Central Planning Office in 1952 and been impressed with the high quality of the 
thinking there, as contrasted with, e.g., the corresponding British outfits.” 


JOSEPH BERKSON, 58, is Head of Biometry and Medical Statistics at Mayo 
Clinie and Professor of Biometry at the University of Minnesota. He received his 
B.S. in engineering at the College of the City of New York in 1920, his M.A. in physics 
at Columbia in 1922, his M.D. at Johns Hopkins in 1927, and his D.Sc. in statistics 
there in 1928. He remained until 1930 at the Institute for Biological Research, work- 
ing with Dr. Raymond Pearl, whose Introduction to Medical Biometry and Statistics 
he reviewed for the JASA in 1942, pp. 145-7. He has made twenty other con- 
tributions to the Journal, but the present article is his first since 1955. He was with 
the U. 8. Public Health Service in 1931 as a surgeon; since 1931 he has been at the 
Mayo Clinic and since 1949 he has been connected with the University of Minnesota. 
From 1942 to 1945 he was a Colonel in the Medical Corps in the Office of the Air 
Surgeon, Army Air Forces, as Chief of the Division of Medical Statistics. He is a 
Fellow of the American Association for the Advancement of Science, the American 
Public Health Association, the American Statistical Association, the Institute of 
Mathematical Statistics, and the Royal Statistical Society of London. He is a mem- 
ber of the International Statistical Institute, the Expert Advisory Panel on Health 
Statisties of the United Nations, and of Delta Omega and Sigma Xi. He received the 
Legion of Merit of the War Department in 1946. Numerous articles by Dr. Berkson 
have appeared in other scientific journals. 


ZYGMUNT WILLIAM BIRNBAUM, 54, is Professor of Mathematics and 
Director of the Laboratory of Statistical Research at the University of Washington. 
He received the degree of Master of Law in 1925 at the University of Lwow, Poland 
and the Ph.D. in mathematics there in 1929. The next two years were spent in re- 
search at the University of Goettingen, Germany. From 1931 to 1936 he was an 
actuary in Austria and Poland, and from 1937 to 1939 he had a research appoint- 
ment at New York University. Since then he has been at the University of Washing- 
ton except when he was visiting professor at Stanford in 1951-52. He is a Fellow of 
the American Statistical Association and of the Institute of Mathematical Statistics, 
and since 1955 has been an Associate Editor of the Annals of Mathematical Statistics. 
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He has published numerous papers on conformal mapping, inequalities, probability, 
and theoretical and applied statistics. His interest in statistical problems ranges from 
pure mathematical theory to engineering application of statistics. His “Bias due to 
non-availability in sampling surveys” appeared in the JASA in Mar. 1950, and his 
“Numerical tabulation of the distribution of Kolmogorov’s statistic for finite sample 
size” in Sept. 1952. About his present article he says: “My interest in problems of life- 
length of materials, in. particular of fatigue-life of metals, dates back almost ten years 
when, in the course of some consultations, I came in touch with these problems. From 
that time on I have been thinking of applying the methods and concepts of stochastic 
processes to this field, and the present paper is the first of a sequence of publications 
planned under this program.” 


HUBERT MORSE BLALOCK, JR., Assistant Professor in the Department of 
Sociology of the University of Michigan, is 31 years old. He received his A.B. at 
Dartmouth in 1949, his M.A. at Brown in 1953, and his Ph.D. in sociology (minor in 
statistics) at the University of North Carolina in 1954, since when he has been at the 
University of Michigan. This is his first appearance in the J ASA, but in the American 
Sociological Review he has published “Economic discrimination and Negro increase” 
(October 1956) and “Percent non-white and discrimination in the South” (December 
1957). His main fields of interest, as these titles suggest, are race relations, sociological 
theory, and, of course, applied statistics. 


DUDLEY JOHNSTONE COWDEN is Professor of Economie Statistics in the 
School of Business Administration of the University of North Carolina, at present at 
the University of London as Honorary Research Associate, and is 59 years old. He 
received his A.B. in history at Grinnell in 1919, his M.A. in business at Chicago in 
1922, and his Ph.D. in economics at Columbia in 1931. He taught economics at 
Lafayette College, 1926-29, at St. John’s University (Brooklyn), 1931-34, and at 
Williams College 1934-35, since when he has been at North Carolina. He is best 
known for his two texts with F. E. Croxton, Applied General Statistics (Prentice-Hall, 
1939, 1955) and Practical Business Statistics (Prentice-Hall, 1934, 1948). Last year 
Prentice-Hall brought out his Statistical Methods in Quality Control (reviewed in 
the Dee. 1957 J ASA). His previous publications in the J ASA are: “Moving Seasonal 
Indexes” (Dec. 1942), “Correlation Concepts and the Doolittle Method” (Sept. 1943), 
“An Application of Sequential Sampling to Testing Students” (Dec. 1946), “Simpli- 
fied Methods of Fitting Certain Types of Growth Curves” (Dec. 1946), and “The 
Multiple-Partial Correlation Coefficient” (Sept. 1952). He is especially interested in 
business and economic statistics; quality control, particularly the economics of qual- 
ity control; and statistical theory, particularly regression and correlation. His article 
in this issue results from having noticed early in his teaching career that many text- 
books describe methods of deriving multiple and partial correlation coefficients from 
zero-order coefficients, but none treat regression coefficients similarly. He noted, 
however, that the back solution for the Doolittle method does make use of regression 
coefficients of the same order as, and lower than, the one being computed. The present 
article started as a short note submitted to The American Statistician, whose Editor 
thought it more appropriate for the J ASA, whose Editor in turn suggested expansion 
to include mathematical derivations and something on tests significance. 


MORRIS HERMAN DEGROOT is Assistant Professor in the Department of 
Mathematics at Carnegie Institute of Technology. He received his bachelor’s at 
Roosevelt .University in mathematics in 1952, his master’s at the University of 
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Chicago in statistics in 1954, and at 26 will be receiving his Ph.D. in statistics at 
Chicago in the same month that this issue appears. In The American Journal of 
Human Genetics he has published “Efficiency of Gene Frequency Estimates for the 
ABO System” (1956, pp. 39-43) and “The Covariance Structure of Maximum Likeli- 
hood Gene Frequency Estimates for the MNS System” (1956, 229-35). He is inter- 
ested in both theoretical and applied statistics. The present article, his first in the 
J ASA, is based on work with Jack Nadler during the summer of 1956 at the Statis- 
tical Research Center of the University of Chicago. 


RICHARD JAY FOOTE, 43, recently left the Department of Agriculture where 
he had worked in the commodity price and demand forecasting field for 12 years and 
is now Director of Commodity Research for the Connell Rice and Commission 
Company, Inc., 19 River Street, Stamford, Connecticut. He received a B.S. in 
agricultural economics from Michigan State in 1935 and an M.S. from Iowa State in 
1937. Besides government handbooks and bulletins, he has published Agricultural 
Prices (co-author) (McGraw-Hill, Ist ed. 1936, 2nd 1952), “A comparison of single 
and simultaneous equation techniques” in Journal of Farm Economics (1955), and 
“How research results can be used to analyze alternative governmental policies” 
(co-author) in Agricultural Economics Research (1956). In the JASA he published 
“The mathematical basis for the Bean method of graphic multiple correlation” in 
Dec. 1953. He is an econometrician with primary interests in applications of systems 
of simultaneous equations for analytical and forecasting purposes in relation to 
agricultural commodities. Of his present article he states: “The basic research on 
which the present article was based was an attempt to state Cowles Commission 
literature with respect to systems of simultaneous equations in terms that might be 
understandable to most applied research workers. We decided to give a method for 
ordinary least squares analysis that would be similar to the methods used for systems 
of equations. The result was a method that has many advantages over those com- 
monly in use. Actually experts in the field, such as Dwyer, have already recom- 
mended these methods.” 


BERT GEORGE HICKMAN, age 33, is Research Associate at the Brookings 
Institution, 722 Jackson Place, N.W., Washington, D. C. He received his Ph.D. at 
Berkeley in 1951. His research and teaching positions have been at Stanford, 1949- 
51, the National Bureau of Economic Research, 1951-52, Northwestern, 1952--54, 
President’s Council of Economic Advisers 1954-56, and the Brookings Institution 
since 1956. This is his first journal article, except for one simultaneously for the 
Feb. 1958 issue of The Review of Economics and Statistics. The National Bureau has 
published papers by him on “The Korean War and United States Economic Activity, 
1950-1952” and “Capacity, Capacity Utilization, and the Acceleration Principle,” 
and the Joint Economic Committee published his “Federal Spending and the Stability 
of the Postwar Economy.” His main interests are business cycles, income and em- 
ployment theory, and monetary theory. Asked about the origin of the present paper, 
he wrote: “The experiment in weighting was undertaken because I wanted to use 
diffusion indexes in comparative studies of business cycles, and felt that I could not 
do so with confidence until I had satisfied myself that the neglect of differences in the 
relative importance or amplitudes of individual activities did not unduly affect the 
behavior of the available unweighted indexes. Incidentally, I make analytical use of 
diffusion indexes in a paper on ‘Postwar Cyclical Experience and Economic Stability’ 
which was presented at the December meeting of the AEA and will be published in 
the May 1958 issue of the Papers and Proceedings. This last paper includes a test of 
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the hypothesis (advanced in the JASA paper) that differences in diffusion would be 
found among industries grouped according to rate of growth over intervals spanning 
several business cycles.” 


ROBERT HOOKE is Manager of the Statistics Section of the Mathematics De- 
partment, Westinghouse Research Laboratories, Beulah Road, Churchill Boro., 
Pittsburgh 35. He is 39 years old, and received his bachelor’s and master’s degrees 
at the University of North Carolina in 1938 and 1939. His Ph.D., like the other degrees 
in mathematics, was awarded by Princeton in 1942. He taught at North Carolina 
State 1941-46 and at the University of the South 1946-51. In 1951-52 he was a mem- 
ber of the Operations Evaluation Group in the Department of Defense, and from 
1952 until 1955 was a member of the Statistical Research Group and Analytical 
Research Group at Princeton University. He has been with Westinghouse since 1955. 
In the JASA for Sept. 1954, he published, with M. H. Belz, “Approximate distribu- 
tion of the range in the neighborhood of low percentage points.” His other papers 
are: “Finite groups and restricted lie algebras” (Sept. 1951) in the The Mathe- 
matical Journal, “Linear p-adic groups and their lie algebras” in the Annals of 
Mathematics (Oct. 1942), “Symmetric functions of a two-way array” and “Some appli- 
cations of bipolykays to the estimation of variance components” in the Annals of 
Mathematical Statistics (Mar. 1956), and “Control by automatic experimentation” in 
Chemical Engineering (June 1957). His special interests are design of experiments, 
particularly as applied to the physical sciences and to automation; and various aspects 
of mathematical statistics. “The problem treated in the present paper was suggested 
by a question from a scientist who wanted to know the ‘best’ formula for approxi- 
mating the centroid of the area under a curve that had been found experimentally. 
In the literature on numerical integration I did not find any treatment of experi- 
mental errors in the data, and this paper was the result of thought about this prob- 
lem.” 


OTTO JACOB KARST, 43, is Associate Professor of Education at the School of 
Education of New York University (31 Press Building, Washington Square, New 
York 3). He received the B.A. from Montclair State Teachers College, New Jersey, 
in 1936, and the M.S. and Ph.D. from NYU in 1944 and 1956. From 1946 to 1950 he 
was Assistant Professor of Physics and from 1950 to 1957 he was Associate Professor 
of Mathematics at Stevens Institute of Technology. He has published “The funda- 
mental theorem of calculus revisited” in the New Jersey Mathematics Teacher (Oct. 
1954) and “Analysis of the required undergraduate mathematics courses for engi- 
neering students” in School Science and Mathematics (Jan. 1955). His principal inter- 
ests are statistical inference and probability. Reading in a statistics text that it was 
not convenient to use least deviations as a basis for a line of best fit started him on 
work which resulted in the present article, his first in the JASA. 


“WILLIAM RAMSDELL LEONARD, 1957 President of the American Statistical 
Association, is Director of the Statistical Office of the United Nations, lives at 30 
Amherst Road, Port Washington, New York, and is 53 years old. He received an 
A.B. from Whitman College in economics in 1926, an A.M. in economics from Tufts 
in 1928, and a Ph.D. in economics and statistics from Cornell in 1934. From 1934 
to 1946 he was a staff member of the Central Statistical Board, which later became 
the Division of Statistical Standards of the Budget Bureau. In 1947 he became 
Deputy Director of the Statistical Office of the United Nations, later Acting Director, 
and in 1950, Director. He is co-editor of Government Statistics for Business Use (Wiley, 
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Ist ed. 1946, 2nd ed. 1956) and contributor on the history of statistics to the Diction- 
ary of American Biography, the Encyclopaedia of the Social Sciences, and the Encyclo- 
paedia Britannica. His chief interest is in the development and execution of statistical — 
programs. His only previous article in the JASA was “Opportunities for statistical 
work in the Federal Government” (Mar. 1945). 


MILTON DOUGLAS LIEBERMAN, 43, Planning Assistant for the 1960 Census 
of Housing, United States Bureau of the Census, lives at 6505 Laverock Lane in 
Bethesda, Maryland. He graduated at Penn State in 1936 and did graduate work at 
George Washington, American, and Georgetown Universities. Three years’ teaching 
in Philadelphia started his career; since then he has been with the Bureau of the 
Census except for three years in the Army (he is now a Major in the Air Force Reserve) 
and two years as United Nations Census adviser to the Government of the Union of 
Burma 1952-54. He published “The census programme in Burma” in Civilizations, 
1955, but has had no previous publication in the JASA. When he met statisticians of 
other countries at United Nations conferences, he found “great interest shown in the 
Philippine statistical program” and he feels that his description of the Philippine 
program will be of help to those working in other countries. 


JACK NADLER, 28, received his A.B. from Brown in 1951 and his A.M. from 
Chicago in 1954, both in economics. Currently he is a Ph.D. candidate in the Depart- 
ment of Statistics at Chicago. He joined the technical staff of Bell Laboratories last 
October, and his address is Whippany Laboratory, Whippany, New Jersey. The pres- 
ent paper, his first publication, is based on work done during the summer of 1956 
when he and his co-author were both Research Assistants at the Statistical Research 
Center, University of Chicago. 


DES RAJ, 38, is at present Associate Professor of Mathematics at the American 
University of Beirut, Lebanon. He is on leave from his regular post at the Indian 
Statistical Institute, 203 B.T. Road, Calcutta 35. He received bachelor’s and master’s 
degrees in mathematics from the University of the Punjab and master’s and doctor’s 
in statistics from Caleutta University. He has taught statistics at Lucknow Uni- 
versity 1947-53, at the Indian Statistical Institute 1953-56, and at the American 
University of Beirut since 1956. He has been associated with the Indian National 
Sample Survey being conducted by the Indian Statistical Institute. He is interested 
in sample survey methods and theory, and estimation theory. In the JASA he 
published “Estimation of the parameters of type III populations from truncated 
samples” in June 1953, and “Some estimators in sampling with varying probabilities 
without replacement” in June 1956. His present paper is a follow-up of a paper he 
published in Ganita, “On sampling with probabilities proportionate to size” (1955), 
and it gives similar comparisons with other methods of sampling. He has also had 
papers in Sankhya, the Annals of Mathematical Statistics, and the Journal of the 
Indian Society of Agricultural Statistics. 


DORIS PHELPS ROTHWELL, is Chief, Section of Farm Products and Foods, 
Division of Prices and Cost of Living, Bureau of Labor Statistics, Washington 25, 
D. C. She received an A.B. from Mount Holyoke in 1926 and an M.A. in Economics 
from American University in 1944. She did statistical research in the Controller’s 
Office of R. H. Macy and Company from 1927 to 1932; was a statistician with the 
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National Park Service from 1935 to 1942; and has worked in several branches of the 
Bureau of Labor Statistics from 1942 to date. The present article, her first in the 
J ASA, “is the outgrowth of long research at the Bureau for a suitable formula for 
use with varying seasonal weights in calculating the Consumer Price Index, which 
was undertaken as part of the 1953 comprehensive review and revision of the index.” 
In the Monthly Labor Review she has published “Price control in Canada” (Aug. 
1945), “Price control since the general maximum price regulation” (Oct. 1945), and 
“Trend contrasts in commodity prices and service rates” (May 1954). She wrote the 
article on “Living costs and standards” for the 1946 New International Yearbook, and 
the following BLS bulletins: No. 879, The General Maximum Price Regulation; 966, 
Consumers’ Prices in the United States, 1942-48 ; and 1039, Interim Adjustment of Con- 
sumers’ Price Index. Her fields of interest are index number methodology and price 
analysis and measurement. 


SAM CUNDIFF SAUNDERS, a research specialist with the Boeing Airplane 
Company at Seattle, received his bachelor’s degree at the University of Oregon in 
1952 and his doctor’s in mathematical statistics at the University of Washington in 
1956. His fields of interest are statistical processes and statistical inference in applied 
analysis. At 26, this is his first publication. 


HENRY SCHEFFFE is Professor of Statistics at the University of California 
(Berkeley). He is 50 years old, received his A.B. at Wisconsin in 1931, his A.M. there 
in 1933, and his Ph.D. in 1935, all in mathematics. He also did a year of post-doctoral 
study in mathematical statistics at Princeton in 1941-42. He was on the faculty at 
Wisconsin from 1931 to 1937, at Oregon State 1937-39 and 1940-41, at Reed 1939-— 
40, at Princeton 1941-44, at Syracuse 194446, at UCLA 1946-48, at Columbia 1948- 
53, and at Berkeley since 1953. He was president of the Institute of Mathematical 
Statistics in 1954, and vice president of the ASA (of which he is a Fellow) 1953-56. 
His previous pubtications in JASA are “The relation of control charts to analysis of 
variance and chi-square tests” (Sept. 1947) and “An analysis of variance for paired 
comparisons” (Sept. 1952). He has published many well known statistics papers in 
the Journal of the Royal Statistical Society Series B, Annals of Mathematical Statis‘ics, 
Sankhya, and Biometrika. The present paper, like most of his recent papers, grew 
out of a consulting problem. 


KIRON CHANDRA SEAL at 29 is Senior Research Officer, Health Division, 
Planning Commission of the government of India, stationed at Udyog Byavan, New 
Delhi. He received the B.Sc. in mathematics at Calcutta University in 1947 and the 
M.Sc. in statistics in 1949, the A.M. in mathematical statistics at Princeton in 1953, 
and the Ph.D. in statistics at the University of North Carolina at Chapel Hill in 
1954. He worked in the Damodar Valley Corporation, Caleutta, a multipurpose river 
valley project similar to the TVA, as Statistical Officer 1954-57, and joined the Plan- 
ning Commission staff last September. The present article is his first in the JASA. 
He published “On errors of estimates in various types of double sampling procedure,” 
On certain extended cases of double sampling,” and “Approximate distribution of 
certain linear function of order statistics” in Sankhya in 1951, 1953, and 1957. “On a 
class of decision procedures for ranking means of normal populations” appeared in 
Aanals of Mathematical Statistics in 1955 and “On a characterization of gamma dis- 
tributions” in the Calcutta Statistical Association Bulletin in 1957. His special interest 
is in mathematical statistics, and the problem in the present article was first con- 
sidered in 1954 in connection with his doctoral thesis. 
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SAMUEL ANDREW STOUFFER, 57, is Professor of Sociology at Harvard. He 
has an A.B. from Morningside College, Sioux City, Iowa (1921), an A.M. from 
Harvard (1923), and a Ph.D. from the University of Chicago (1930), as well as an 
L.L.D. from Morningside (1939) and an honorary Sc.D. from Princeton (1948). He 
edited the Sac City Sun (1923-26), was instructor in statistics at the University of 
Chicago (1930-31), assistant professor of social statistics at the University of Wis- 
consin (1931-34), professor (1934-35), professor of sociology at the University of 
Chicago (1935-46) with leave beginning in 194i as Staff Director, Research Branch, 
Information and Education Division, War Department. He is author of research 
monographs and papers applying quantitative methods in sociology and co-author 
of The American Soldier. In the JASA he has published “Tests of significance in 
applying Westergaard’s method of expected cases to sociological data” (co-author) 
(Sept. 1933), “Criteria of differential mortality (co-author) (Dec. 1933), “A coefficient 
of ‘combined partial correlation’ with an example from sociological data” (Mar. 
1934), “Problems in the application of correlation to sociology” in Proceedings Supple- 
ment (Mar. 1934), and “Evaluating the effect of inadequately measured variables in 
partial correlation analysis” (June 1936). 


HARVEY MAURICE WAGNER is Assistant Professor in the Departments of 
Statistics and Industrial Engineering at Stanford. He has both bachelor’s (1953) and 
master’s (1954) degrees in statistics from Stanford. Now, at 26, he is a candidate for 
the Ph.D. in economics at Massachusetts Institute of Technology. He was a Marshall 
Scholar at Cambridge University in 1954—55, and instructor in the School of Indus- 
trial Management at MIT from 1955 to 1957. His published titles include: “An 
eclectic approach to the pure theory of consumer behavior” in Econometrica (Oct. 
1956), “ A linear programming solution to dynamic Leontief type models” in Manage- 
ment Science (Apr. 1957), “A unified treatment of bargaining theory” in Southern 
Economic Journal (Apr. 1957), “A Monte Carlo study of estimates of simultaneous 
linear structural equations” in Econometrica (Jan. 1958), “A comparison of the original 
and revised simplex methods” in Operations Research (June, 1957), “Dynamic prob- 
lems in the theory of the firm” (co-author) in Naval Research Logistics Quarterly (Mar. 
1958), “The run-off election paradox” in Labor Law Journal (Feb. 1957), and “The 
simplex method for beginners” in Operations Research (Apr. 1958), but nothing 
previously in the JASA. Management science and operations research, mathe- 
matical economics, statistics, and data processing are his interests. 


HELEN MARY WALKER at 66 has just retired as Professor of Statistics at 
Teachers College, Columbia University, where she had served since 1925. She is a 
Fellow of the American Statistical Association and was President in 1944. In 1949 
she was President of the American Educational Research Association. In the JASA 
she has published “The role of the American Statistical Association,” her presidential 
address (Mar. 1945), “Bicentary of the Normal Curve” (Mar. 1934) and “A Model 
to Aid in Teaching Partial Correlation” (co-author) (Mar. 1932). She is the author of 
four well-known books: Studies in the History of Statistical Method (Williams and 
Wilkins, 1929), Mathematics Essential for Elementary Statistics (Holt, 1934), Ele- 
mentary Statistical Methods (Holt, 1943), and Statistical Inference (co-author) (Holt, 
1953). The present article resulted from an invitation of the Program Committee 
for the 1957 meetings. Her interest in the history of statistics and the fact that she 
had studied under Karl Pearson, made her a natural choice for the assignment. 


ARNOLD ZELLNER, 31 years old, is Assistant Professor of Economics at the 
University of Washington (Seattle), where he has been since 1955. His A.B. in physics 
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was received at Harvard in 1949, and his Ph.D. in economics at Berkeley in 1957. 
In 1949-50 he was a graduate student and teaching assistant in physics at Berkeley, 
from 1951 to 1953 he was a private in the Army, and from 1953 to 1955 was a graduate 
student in economics at Berkeley. His previous articles include two on “The electro- 
phoretic properties of red blood cells” (co-author) in the Archives of Biochemistry 
and Biophysics (Jan. and Feb. 1956), “An interesting general form for a production 
function” in Econometrica (Apr. 1951), “The short-run consumption function” in 
Econometrica (Oct. 1957), “Consumption and the consumption function in the 1948— 
49 recession” in the Review of Economics and Statistics (Aug. 1957), and “International 
comparison of unemployment rates” (co-author) in the National Bureau of Economic 
Research volume, The Measurement and Behavior of Unemployment (Princeton, 1957). 
The present article had its origins in some work utilizing personal consumption ex- 
penditure and personal disposable income data which he had done previously. He had 
to decide whether or not to use some of the Department of Commerce’s provisional 
quarterly estimates along with revised estimates in testing hypotheses about the 
short-run consumption function. A brief comparison of former revised and provisional 
estimates convinced him that a systematic study of these estimates was needed. 





BOOK REVIEWS 


Introduction to Statistical Reasoning. Philip J. McCarthy. New York: McGraw-Hill Book 
Co., 1957. Pp. xiii, 402. $5.75. 


Wituiam G. Mapow, Stanford Research Institute and Stanford University 


HIs introductory book is intended for use in one-semester courses for students 
Tho expect to work in the social sciences and have little mathematical prepara- 
tion. 

The author has had at least three major objectives: (1) to teach the basic concepts 
of statistical reasoning, (2) to teach the statistical techniques used in many parts of 
the social sciences, and (3) to provide in an introductory text illustrative material 
taken from significant investigations in the social sciences. “A very brief account of 
research problems usually accompanies each illustrative example, and any student 
who chooses to read some or all of the cited references may expect to improve his 
insight into the problems of research methodology in the social sciences.” (p. vii) 

The first of the twelve chapters is an introduction. 

Chap. 2 is entitled “Components of a statistical investigation.” It consists largely 
of a discussion of the parts of a sample survey of a human population. It introduces 
many of the topics studied later in the book. 

The third, fourth, and fifth chapters deal with the distribution of a variable, and 
the location and spread of a distribution. These chapters are largely, but not entirely, 
devoted to the standard topics of descriptive statistics. Frequency and per cent dis- 
tributions, both cumulative and noncumulative, histegrams, graphs, and the com- 
putation of means, medians, modes, percentiles, ranges, and other measures are care- 
fully introduced, illustrated, and discussed. In addition, distinctions are made 
between quantitative and qualitative variables, finite and infinite populations, sam- 
ple and population, and continuous and discrete random variables. Summation nota- 
tion is introduced, and is adequately used both in this and later chapters. 

Chaps. 6 and 10, respectively, discuss simple random sampling and the elements 
of sampling design. Symbols are used less often than in the other chapters. The words 
used in sampling are introduced and explained. Specifications are stated for selecting 
samples according to methods such as simple random sampling, cluster sampling, 
stratified sampling, and systematic sampling. No attempt is made to show how to 
select or use the results of the more complicated designs that occur in practice, but 
references are given for further reading in books where these topics are discussed. 
The dangers in using nonrandom sampling designs are considered. The use of a Table 
of Random Numbers is indicated, and the distinction between sampling with and 
without replacement is made. Estimation equations are not discussed except for 
means and weighted means. 

In Chap. 7, probability is introduced as a means of analyzing experiments with 
uncertain outcomes. Essentially, no mathematics is assumed. The author explains 
the meanings of words and theorems occurring in elementary probability ; among these 
are conditional probability and the basic theorems on compound events. The need for 
obtaining models is stated here, as elsewhere in the book. (However, the model to 
which the author refers is the probability model of the outcomes of an experiment 
rather than the model of the experiment or process itself that produces the probability 
model of the outcomes.) 

Chaps. 8 and 9 deal with probability models and statistical inference. Assumptions 
are made concerning the distribution of the variables and the author considers very 
clearly and nonmathematically such matters as the probability distribution of arith- 
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metic means, convergence in probability, confidence limits, null hypotheses, alterna- 
tive hypotheses, and errors of the first and second kinds. (Only a reference is given 
to a nonmathematical work on decision theory.) The binomial and normal distribu- 
tions and their properties are discussed. So also are the expected value, the finite 
sampling correction for the variance and tests of differences between two means. 
Distribution theory for small samples and nonparametric tests are very briefly men- 
tioned. The author uses the effective technique of listing all possible outcomes and 
their probabilities in order to answer questions dealing with probabilities of compound 
events and with expected values. 

In Chap. 11, the nature and uses of the x? distribution are discussed clearly and in 
detail, both for one and two dimensional tables and two or more categories. The 
minimum size of sample for the use of the x? distribution and the correction for con- 
tinuity are considered. 

Chap. 12 is devoted to linear regression and correlation. The nature of regression 
and least squares is explained. Estimates of the variance and confidence limits for the 
coefficients, the regression line, and an additional observation are all given. The dis- 
tinction between regression and correlation is fairly well made. Rank correlation and 
multiple regression are both introduced. 

At the end of each chapter some carefully selected exercises appear. Charts and 
graphs are well prepared and used. 

The verbalization of the topics of mathematical statistics that the author selected 
for the book is excellently done except in a few cases in Chap. 12 where the non- 
mathematical statements of theorems are likely to confuse the reader. This reviewer 
does not agree that one can essentially omit decision theory from an introduction to 
statistical reasoning; he does not agree that considerations of experimental design can 
be omitted from an introduction to statistics intended for social scientists, many of 
whom are more concerned with experiments than surveys; nor does he agree that a 
discussion of nonlinear estimation equations can be omitted from a chapter on sample 
design. 

Perhaps more fundamental is the question of what should be contained in an in- 
troduction to statistical reasoning written for social scientists today. Statistics, the 
applications of statistics in the social sciences, and the quantitative and mathematical 
training of social scientists are all rapidly changing. Notions of decision making, of 
stochastic processes and of models in the social sciences are spreading not only among 
the expert technicians but also among businessmen and practicing social scientists. 
A new field, management science, is being developed. These ideas should be intro- 
duced to make the reader aware of the increased scope of statistical reasoning that is 
now emerging. 

There are a few places in the treatment of practical questions and illustrations with 
which this reviewer takes issue. Unfortunately, even a brief discussion of these points 
requires considerable space. But the main point is that in order to discuss illustrative 
material taken from research investigations on the level of an introductory course 
the author has often oversimplified and, at least twice, altered the research. It seems 
regrettable that this was done since the research benefits are thus largely lost. 

Two errors are made in connection with the applications of sampling in practice. 

The book refers to a study “Child Training and Personality: A Cross-Cultural 
Study” by John W. M. Whiting and Irvin L. Child, 


“ _.. based on data obtained from the Cross-cultural File at the Institute of Human 
Relations of Yale Wniversity. This file contains ethnographic reports about various 
primitive societies or cultures that have been observed by anthropologists. If a culture 
or society is regarded as a unit, if one thinks of the population of all primitive cul- 
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tures that have existed or do exist at the present time, and if there is no reason to 
believe that the anthropologists have been unduly ‘selective’ in choosing such cultures 
for observation, then the cultures included in this file can be regarded as a simple 
random sample of human cultures in general.” (McCarthy, p. 202) 


If I understand the last sentence, the author is suggesting that unless one has 
reason to believe that those selecting a sample have been “unduly selective” one may 
regard the sample as having been obtained by simple random sampling. And in par- 
ticular one could easily forgive any anthropologist who assumed on the basis of the 
above paragraph that he was being told that the Cross-cultural File could be regarded 
as a simple random sample if he is willing to make a few assumptions. 

It is worth pointing out that anthropologists do not claim that the cultures in- 
cluded in the File constitute a simple random sample of cultures. Originally it had 
been hoped that the sample would be a stratified random sample but various factors, 
such as the availability of good ethnographic reports, the availability of funds for 
complete coverage of certain strata, special interest or lack of interest in certain cul- 
tures, and so on, have resulted in the File possibly constituting a good intermediate 
level for selecting probability samples of cultures. (This reviewer must confess dis- 
appointment that even in the most recent effort, “World ethnographic sample” by 
G. P. Murdock, American Anthropologist, Aug. 1957, pp. 664-87, it does not seem 
that satisfactory sampling methods were used.) Thus the author seems to go further 
than anthropologists themselves in suggesting that the File may be treated as a 
simple random sample. 

But even if anthropologists may know that they have been “unduly selective” in 
constructing the file, what about others who are not so sophisticated in their under- 
standing of what they do? After all, it is just incorrect to say that a belief that one 
has not been unduly selective in choosing a sample implies that the sample is random. 
A specialist in sampling may be able to judge that the lack of importance of a study, 
or its uses, or its necessarily small size, or the type of analysis contemplated do not 
require precise adherence to what the author has written in Chap. 6, but the reader 
of an introductory text does not know how far he can go. For this reviewer, the 
dividing line has generally occurred before confidence intervals are computed, as they 
are in this illustration in Chap. 8; if the user of the data wants confidence intervals, 
he should select his sample so that he can calculate them. 

The other error in sampling occurs in connection with the selection of clusters. 

The book contains the following statement as an illustration of sampling: 


“For example, organizations have selected samples of business establishments (scat- 
tered over the United States) by first choosing a sample from a list and then associat- 
ing with each chosen establishment those three or four additional establishments 
which are geographically closest.” (p. 275) 


To follow the procedure embodied in the above excerpt would bias the sample 
against outlying establishments. For example, if one wished to select two of three 
establishments on a single street by this method, the middle establishment would be 
certain to be in the sample, the two outlying establishments would each have prob- 
ability, 4, of being in the sample. Usually, outlying establishments differ from those 
in the center of an area. Furthermore, no mention is made of how one determines 
geographic closeness and the various biases that the answer to that question may 
introduce in the case of establishments such as factories or farms which occupy large 
areas. It may be the bias is sometimes small in practice, but this is a question to be 
settled for each individual study. 

Another point on sampling is that after discussing (p. 44) a survey intended to 
provide estimates of proportions that are then used, together with data from other 
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sources to estimate frequencies, the author remarks, “If desired, it is also possible to 
design samples thet will provide approximations to these population numbers without 
recourse to external data.” In practice, the cost of surveys intended to provide fairly 
good estimates of frequencies without use of external data is so high that they are 
rarely made in the United States except by the major collecting agencies. 

Let us now consider a few examples of how the illustrative material is handled. As 
mentioned above, much of that material is taken from investigations in the social 
sciences. This reviewer is critical of the book’s treatment of illustrative material be- 
cause the greatest real advantage of using “live” material, namely, that one can see 
how realistic problems are studied, has not been achieved. 

For example, at various points the author has used data taken from sample surveys 
as illustrative material. Although these samples were not simple random samples, the 
author after stating, “Assuming that the . . . constitute a random sample .. . ” pro- 
ceeded to analyse the data as though the'samples were random. Nowhere is it sug- 
gested that such an analysis may be incorrect since the sampling methods are invari- 
ably more complex than simple random sampling. 

But there are two illustrations that have more serious defects. The following excerpt 
is from the discussion of the Whiting and Child study. 


“One of the items of interest to Whiting and Child was the extent to which primi- 
tive cultures explained bodily illness by either soul loss (i.e., a person gets sick because 
his soul has left his body) or spirit possession (i.e., a person gets sick because an alien 
spirit has entered his body). Accounts of 75 cultures from the file were examined, and 
38 showed evidence of using such explanation of bodily illness while 37 showed such 
explanations were not used. What would be the 95 per cent confidence limits for the 
true fraction of primitive human cultures that did use this type of explanation?” 
(McCarthy, p. 202) 


Before commenting on the above excerpt from this book, I should like to stress 
that I am criticizing neither the Cross-cultural File nor the monograph by Whiting 
and Child. Each contains much valuable information. 

No mention is made in the excerpt concerning how the 75 cultures were selected. 
But, by implication, we would assume that the 75 cultures are a simple random sample 
of the File and hence of all cultures. But Whiting and Child say (p. 48), “The source 
material consists of extracts from ethnographic reports about the culture of 75 differ- 
ent primitive societies which were selected and used in our study simply as being the 
75 for which the necessary material (on child training) was available.” Also of the 75 
cultures, 65 came from the File, 10 others were used because reports on them had 
“especially full information on child training practices” (p. 49). Thus, the 75 cultures 
were not a sample, simple random or any other, from the File. And here another very 
difficult problem of anthropology comes to fore. Anthropologists cannot easily get 
additional information concerning cultures. Ethnographic reports are not based on a 
standardized questionnaire. Often one knows only that the lack of mention of some 
custom or practice means that it was not reported—it is rare and may be erroneous 
to report that a custom or practice does not exist; it may only not be identified. Thus 
from the questionnaire point of view, the ethnographic reports not only yield large 
numbers of unknowns; the possible answers are likely to be biased as well. Conse- 
quently decisions to include societies in the sample on the basis of availability of 
information may bias the conclusions; perhaps the availability of information reflects 
the importance of the subject of study to either the culture or reporter. 

The author says that one of the items of interest to Whiting and Child was the 
extent to which a certain explanation of bodily illness is used by primitive cultures. 
That is not correct. Whiting and Child were interested in the relationship of child 
training practices to the explanations of bodily illnesses. As a result they acted so as 
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to achieve the division of the 75 cultures into 2 almost equal groups, 38 cultures 
having the spirit loss or possession explanation, and 37 not having such an explana- 
tion. Let us document this assertion. 

In order to have data of high quality to use in their study, Whiting and Child had 
two judges read each of the 75 ethnographic reports to determine the explanations 
of illness used by the society that was the subject of the report. They say (p. 154), 
“If our two judges agreed in finding evidence for the explanation, or any of the ex- 
planations, described in an item, for a given society, that item was considered to be 
present in the culture of that society.” Had Whiting and Child not wished an equal 
division of the 75 societies, it is obvious that the above procedure would have pro- 
duced a downward bias in the number of societies having that item present, but 
(p. 127) “We... placed the cases of disagreement (between the two judges) in the 
‘present’ or the ‘absent’ category according to which would come nearest to dividing 
the total number of societies into two groups of equal size.” 

Thus it is clear that Whiting and Child were not interested in estimating “the true 
fraction of primitive human cultures that did use this type of explanation.” Rather 
they effectively required that about 50 per cent of the societies do so. Consequently, 
the confidence limits computed in the illustration have no meaning. Furthermore, 
the general outcome of all these remarks is that the implication in the book being 
reviewed (p. 202) that the “data have been obtained by a procedure that conforms 
reasonably well to the requirements of a binomial probability model” also is incorrect. 

Let us turn to another illustration taken from social science research. As an i's 
lustration of the binomial distribution, the author considers (pp. 182-3, 207-13, 
220) some research done by Dymond on empathy; in particular the question is 
whether empathy exists. One of the more important problems of statistical applica- 
tions is the formulation of the statistical hypotheses associated with hypotheses con- 
cerning the real world. In the present instance, the author discusses the test of the 
hypothesis that a given individual has empathy (pp. 207-10) and finds that for a test 
of that hypothesis having the desired power it would be necessary to have the indi- 
vidual evaluate 19 other individuals (p. 212) rather than the 6 individuals that Dy- 
mond asked each person to evaluate. Only later (p. 220) does the author mention 
that the use of 20 rather than 7 person groups would greatly alter the experiment if 
not make it impractical. But the basic difficulty is that the author deals with the 
existence of empathy in terms of deciding whether a specified person has empathy 
rather than in terms of deciding whether people in general have empathy. To consider 
the latter question one may perform an experiment consisting of several groups of 7 
individuals as did Dymond and thus arrive at a test having the desired statistical 
properties. 

This illustration could have been used as a reason to discuss the interplay among 
the general hypotheses to be tested, the specific hypotheses that are-tested, and simple 
experimental design. 

It is commendable that the author included illustrative materials from research 
investigations in an introductory book. It is unfortunate that the simplifications of 
this material so greatly reduce its usefulness. 


Elementary Business and Economic Statistics. Alva M. Tuttle. Néw York: McGraw-Hill 
Book Company, Inc., 1957. Pp. xiii, 663. $6.75. 


Joun Neter, University of Minnesota 


pore Elementary Business and Economic Statistics is intended as a text for an 
elementary business and economics statistics course which has no mathematics 
prerequisite except perhaps a course in algebra. The author states that the material 
included is ample for a full-year course in business statistics, if it is covered thor- 
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oughly, and that the text can also be used for shorter courses by a suitable choice of 
topics. 

Tuttle states in the preface that an elementary business statistics course should not 
be devoted exclusively or almost exclusively to problems of statistical inference. Con- 
sequently, more than one-fourth of the text consists of the “descriptive” topics of 
classification of statistical data, sources and collection of statistical data, tabular 
presentation, ratios, and graphic presentation. Following these initial chapters are 
ones on frequency distribution, averages, and dispersion. Tuttle then has one chapter 
on elementary sampling and statistical inference, which is followed by chapters on 
index numbers and correlation. The book concludes with a set of four chapters on 
time series anatysis. 

It can be seen from this description of the text’s structure that the general coverage 
is similar to that of a number of other books in this field. Sources, collection, and classi- 
fication of statistical data are developed rather briefly, although it would seem that 
these are important aspects of the “descriptive” part of business statistics. The chap- 
ter on tabular presentation contains almost exclusively information on the con- 
struction of statistical tables, but very little on the uses and analyses of these tables. 
The same criticism applies to the chapter on ratios. Numerous types of statistical 
ratios are defined, but there is almost no discussion of the meaning of these ratios, 
their usefulness, and their limitations. 

Graphic presentation is discussed in three chapters. Again the major emphasis is 
upon the mechanical aspects of constructing graphs, with less attention being given 
to the interpretation of graphs and their uses and limitations in management. 

The major part of the discussion of averages and dispersion is devoted to computa- 
tional features. For instance, almost four pages are used to explain the computation 
of quartiles from ungrouped data when there are four to seven items in the array; 
the significance of these calculated measures is not considered. The normal curve is 
discussed in connection with the standard deviation, with no transition to probability 
distributions, although probability interpretations are drawn. 

The chapter on sampling and statistical inference contains a discussion of interval 
estimation and testing hypotheses, principally with respect to population means and 
proportions. Various incorrect statements mar the explanation of statistical infer- 
ence, and a number of other statements could easily give the student erroneous im- 
pressions. For instance, the impression is given that there is no statistical sampling 
theory for sampling without replacement from a finite population (p. 269). It is 
stated without qualification that, when sampling a nonnormal population, samples of 
size 30 are sufficient for the distribution of the sample mean to “very closely approxi- 
mate a normal distribution” (p. 274), and the impression is given that in sampling 
nonnormal populations the theoretical results for the mean and standard deviation 
of the distribution of the sample means apply only if the sample size is 30 or more 
(pp. 274, 279). 

A level of significance of .01 is explained as meaning “that one out of 100 of our 
statistical decisions made at this level, in the long run, will be a Type I error” (p. 
278), which does not convey clearly the conditional nature of the probability, namely 
if the null hypothesis is true. Incidentally, despite explaining Type II errors, Tuttle 
does not develop the notion of the power or operating characteristic curve of a test. 

Several times, Tuttle speaks of “testing of a sample mean when the mean and 
standard deviation of the universe with which the sample is to be tested are known, 
or can be assumed by some theory or appropriate hypothesis” (pp. 289, 293); this 
seems to be unfortunate choice of language, as it gives the impression that the popula- 
tion mean is already known. An incorrect statement about the unbiasedness of the 
sample standard deviation (divided by N - 1) asan estimate of the population standard 
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deviation is made (p. 290), the Poisson distribution is called a continuous distribution 
(p. 300), and the impression is given that the results for the mean and standard 
deviation of the binomial distribution are valid only if Np is greater than 5 (p. 300). 
In the discussion of control charts for measurements, it is said that the control chart 
commonly used is either an X or an R chart (p. 306), which gives the impression that 
both are not used together. 

In one illustrative example (pp. 279-83), the time required by experienced workers 
to perform an operation in a plant has a known distribution. To test whether a new 
procedure would speed up the process, a single worker is selected to perform the new 
operation a number of times, his results are compared to the old process mean, and 
it is concluded that the firm should adopt the new process (p. 283). Differences be- 
tween workers are not discussed. In another example, the hypothesis Ho:u <yo or 
Ho: >o is selected depending upon whether the sample mean is less than or greater 
than yuo, but no correction is made for the level of significance actually encountered 
(pp. 293-95). 

Index numbers and time series analysis are discussed along conventional lines, in a 
straightforward manner. This reviewer, for one, would have liked to see more em- 
phasis on the uses of index numbers and time series analysis in business, but this is a 
question of degree where personal judgments differ. 

Correlation is treated mostly from the descriptive point of view, and only at the 
end of the chapter in a brief section is the bivariate normal model discussed together 
with some tests of hypotheses. This latter material is so condensed, however, that it 
may not offer students substantial insights into regression and correlation models 
and related problems of inference. 

All chapters are followed by a section of questions and problems, which should be 
helpful for class discussion and laboratory work. 

Tuttle’s argument that a first course in business and economic statistics should 
include not only statistical inference but also such “descriptive” topics as sources and 
collection of statistical data and tabular and graphic presentation is received sym- 
pathetically by this reviewer. These subjects are not self-evident to most students and 
yet are encountered repeatedly in business and in economic analysis. Students should 
learn about nonsampling errors and biases arising in the collection and processing of 
statistical data so that such data are handled with proper perspective. Knowledge of 
statistical tests concerning, say, a population mean can be misleading if it is not sup- 
plemented by an understanding as to when the mean is a relevant measure for a 
given problem. 

Thus, a careful blending of descriptive and inferential materials seems to be called 
for. This, however, is not present in Elementary Business and Economic Statistics, in 
this reviewer’s opinion. Descriptive topics are too often discussed with great emphasis 
on mechanical features, and little on significance, uses, limitations, and relations to 
problems of inference. Furthermore, statistical inference is treated too sketchily, 
and its presentation is the weakest part of the book. 

These criticisms are in large part the result of differences in point of view. It should 
therefore be added that Tuttle has followed the lead of many other books in this 
field, and, aside from the discussion of statistical inference, has done a satisfactory 
job of presentation within his framework. 


Experimental Designs, Second Edition. William G. Cochran and Gertrude M. Cox. New 
York: John Wiley and Sons, Inc., 1957. Pp. xiv, 611, +5 tables. $10.25. 


HE first edition of this book was reviewed by Palmer O. Johnson in the September, 
1950 issue of this Journal (Vol. 45, pp. 454-55). The framework of the book is un- 
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changed, but a good deal of new material has been added, and some topics have been 
treated more briefly. 

The added material includes two new chapters, one dealing with fractional replica- 
tion of factorial experiments, the other with factorial experiments in which the factors 
represent quantitative variables measured on a continuous scale. At the end of 
Chapter 9 there has been added an index to incomplete block designs now available. 
A number of sections have been added to various chapters. 

“Topics that are presented more briefly are sequential experimentation . . . , the 
testing of effects suggested by the data ..., the problem of making several tests of 
significance in the same experiment ..., Yates’ automatic method of computing 
factorial effect totals ..., additional standard error formulae for split-plot experi 
ments ... , the effects of errors in the weights on the recovery of inter-block informa- 
tion ..., and the use of balanced incomplete block designs in taste and preference 
testing... .” The numbers of new chapters, sections, and tables carry the letter a 
to identify clearly the material that is new in this edition. 

W. A. W. 


An Introduction to Probability Theory and Its Applications, Vol. 1, Second Edition. 
William Feller. New York: John Wiley and Sons, Inc., 1957. Pp. xv, 461. $10.75. 


HE first edition of this volume was reviewed by David Blackwell in the September, 

1951 issue of this Journal (Vol. 46, pp. 381-82). In a Preface to this second edition, 
Feller explains that numerous communications from readers of the first edition 
stimulated him “to think of improvements and to collect better examples and 
exercises,” which “will make for easier reading and teaching from the book.” The 
Preface now contains a page of suggestions on accommodating the book, and courses 
based on it, to “the manifold needs of readers with divergent backgrounds, in- 
terests, and degrees of mathematical sophistication.” “Space saved by streamlining 
made it possible to add new material and to integrate the old third chapter with 
chapter II. New emphasis is laid on waiting times, a topic now serving as a unifying 
thread throughout the beok. . . . Chapter III is entirely new. It illustrates the power 
of combinatorial methods by deriving in an elementary way important results previ- 
ously obtained by advanced analytical tools. The results concerning fluctuations in 
coin tossing show that widely held beliefs about the law of large numbers are fallacious. 
These results are so amazing and so at variance with common intuition that even 
sophisticated colleagues doubted that coins actually misbehave as theory predicts. 
The record of a simulated experiment is therefore included... . A new stress on the 
essential unity of recurrent events and Markov chains permitted improvements and 
simplifications, but at the cost of a change from the terminology of the first edition.” 

W. A. W. 


Introduction to Business. John R. Craf. New York: Henry Holt and Company, 1957. Pp. 
582. $5.75. 


Harry V. Roserts, University of Chicago 


— relevance of this text to readers of this Journal is a 24-page chapter called, 
“Statistics as a managerial tool.” This review is mainly concerned with that 
chapter, but it is desirable to describe briefly the purpose and coverage of the entire 
book. The author states it in this way (Preface): 

“Introduction to Business, as a basic text, is designed to familiarize students with the 
major areas of American business and how business is organized and operated. 

“Other objectives are: (1) to acquaint the student with the role of management; 
(2) to explain the economic setting in which business operates; (3) to introduce the 
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business major and others to the problems of business; (4) to develop a vocabulary 
of business terms; (5) to provide a fund of facts upon which specialized courses in 
business may be built; and (6) to furnish information as to types of positions in which 
business majors may be employed and how to go about securing a position in business. 

“Theoretical concepts have been excluded as they may be more properly incor- 
porated in a treatise on principles of economics. . . . This text, therefore, projects itself 
immediately into a discussion of types of businesses, how they are differentiated from 
one another, and how they are organized. Subsequent sections are devoted to Man- 
agerial Controls and Records, Banking, Finance, and Insurance, Production, Per- 
sonnel, Marketing, Business and Government, and Vocational Guidance.” 

The chapter on statistics is one of three chapters in the section on “Managerial 
controls and records,” the other two being “Accounting records” and “Business 
information.” 

The chapter on statistics provides a brief discussion of graphs and tables, averages, 
index numbers, time series analysis, and statistical quality control. There is a brief 
discussion of “Application of statistics to a specific problem.” At the end of the chap- 
ter there are “Words for study purposes,” review questions, individual assignments, 
and suggested readings. 

The discussion of tables and graphs gives a few simple examples. It commits the 
common fallacy of the misleading pictogram: the height of the largest of a group of 
people represents the number of stockholders in a given income group in spite of the 
fact that the eye tends to judge magnitudes in terms of area rather than height. 
(p. 145) The impression is thus conveyed that there are relatively few stockholders 
in the “over $10,000” group. 

Three pages take up the mean, median, and mode, including illustrations of their 
calculation and a few statements, mostly misleading, about “advantages” and “dis- 
advantages.” For example: 

“The deficiency of the arithmetic average is that the average itself is affected by 
items which appear at either end of the range.” (p. 148) 

“The principal advantage of the median is that, since it is determined by position 
and not by calculation, its value is not influenced by the items which appear at the 
end of the scale. It should be pointed out, however, that most statisticians do not 
deem the median to be representative when the number of cases involved in the 
series is small.” (p. 148) 

The discussion of index numbers deals with the idea of relatives for a single product 
and not with the problem of aggregation. The two numerical examples (pp. 150-51) 
contain eight computational errors (six due to incorrect rounding) and one of the 
rounding errors carries into the text. 

The time series section points out that business changes through time and briefly 
introduces the terms “secular,” “seasonal,” “cyclical,” and “random,” and concludes 
that “... time series analysis, the study of trends as they affect business, is no 
matter to be taken lightly.” (p. 153) 

The concept of correlation is introduced briefly, and the student is cautioned that 
“causal relationships must be present for correlation to be meaningful.” 

Three paragraphs on statistical quality control refer to statistical sampling and 
control charts without explaining either term except to say that “... the control 
chart is something like the runway to the airline pilot. On each landing, the pilot 
attempts to glide down the middle of the runway, ... ” (p. 154) 

The “Application of statistics to a specific problem” considers a hypothetical case 
of an unusually high accident rate. The mechanical steps of a study of this problem 
are briefly described, but no insight into the logic of the attack is given. The student 
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is told, for example: “Analysis takes different forms. The data may be analyzed 
chronologically, geographically, or quantitatively. Analysis of data by the qualitative 
method is also possible in a number of instances.” (p. 156) 

Here are three of the ten review questions: “2. What is the principal disadvantage of 
the arithmetic average? Its principal advantage?” “6. Why do you believe there is 
not a more widespread use of statistics in business?” “10. Suppose you were asked to 
explain the term ‘statistics’ to one who had no knowledge of this area of business. 
What would you tell him?” (p. 157) 

Here is one of the individual assignments. “3. We hear much of fluctuations in 
business activity. Prepare a report listing reasons for fluctuations in business ac- 
tivity.” 

Even if we accept the author’s objective of “familiarization” to the “exclusion of 
theoretical concepts,” his chapter on statistics is bad. As the previous paragraphs 
have pointed out, it is frequently misleading in its treatment of statistical terms. It 
is dull. It gives an unrealistic picture of the use of statistics in business, as illustrated 
by the following paragraph: 

“World War II and the Korean War paved the way toward a better understanding 
of the pie or circle chart. Thousands of men who were taught to compute artillery 
fire data learned that a circle contained 360 degrees. Some of these men, upon return 
to their civilian pursuits, recognized the possibilities of the use of the circle in pre- 
senting data, and the use of the circle chart has increased.” (p. 144) 

These criticisms, however, tend to obscure the fundamental objection: the failure 
to introduce ideas to illuminate details and so present a stimulating, though neces- 
sarily brief, introduction to “Statistics as a managerial tool.” 


Statistical Abstract of the United States, 78th Annual Edition. Bureau of the Census, 
Department of Commerce. Washington, D. C.: Government Printing Office, 1957. Pp. 1071 
with 1185 tables and 40 charts. $3.50. 


CCORDING to the Preface, “Worthy of special mention in this edition is the pre- 
A sentation of summary statistics from the 1954 Censuses of Agriculture, Business, 
Manufactures, and Mineral Industries conducted by the Bureau of the Census... . 
Particularly affected were section 24, Agriculture—General Statistics; section 25, 
Agriculture—Production and Related Subjects; section 30, Manufactures; and sec- 
tion 31, Distribution and Services. 

“...A number of tables relating specifically to national banks and te gold stock 
and coinage were omitted in order to permit the inclusion of additional information 
for all banks and for commercial banks. 

“... among the new tables appearing in this edition are... tables relating to 
atomic energy programs and activities, investment in and capacity of Federal power 
projects, 1956 national election campaign expenditures, foreign travel by United 
States residents, and smoking behavior patterns of population groups. A total of 92 
new tables have been introduced in this edition. 

“Among tables shown in the last edition, 83 were emitted from this issue. Major 
omissions occurred in the Banl gz and Finance section affecting figures on loans, 
securities, capitalization, assets and liabilities of national banks, and on gold stock, 
coinage, and bullion value; in the Distribution and Services section affecting 1948 
Census of Business data; in the Construction and Housing section affecting 1950 
Census of Agriculture data. Some of the information in the omitted tables was ab- 


sorbed in other tables still present.” 
VW, A. W. 
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The Bivariate Normal Probability Distribution. D. B. Owen. Sandia Corporation; available 
from Office of Technical Services, Department of Commerce, Washington 25, D. C. 
$0.65. Paper. 


GertTrupe Bianca, Air Research and Development Command 


= report contains an extensive and well planned table relating to the bivariate 
normal probability distribution. In addition, there is a mathematical introduction 
giving the important properties of the function tabulated and of related integrals; 
there is a section on interpolation in the tables, and some examples showing the type 
of problems in which the functions are useful. The text includes a discussion of the 
trivariate normal and higher dimensional normal integrals, and a bibliography. 
Let 

L pe eMorage 

Th) = =f ai 
*“p,2 (a%+y2-tray /2-2*) 
L(h, kyr) = f f - — § 
k vr 2rVl1 — rt 





Then it can be shown that 
L(h, k, r) + T(h, ©) + T(k, «) bmi T(h, u) — T(k, v), 


where 


u=k—rh/[A(l —r)¥2]; vo = h — rk/[k(l — r)/], 


Thus' L(h, k, r), which depends on three parameters, can be evaluated from T(h, k). 
Moreover, there are relations connecting T'(h, k) with T(h, —k), with T(k, h), and 
with T(kh, 1/k). Hence, it is necessary to tabulate the function only for all values of 
h with k between zero and unity, at the expense of introducing some auxiliary com- 
putations. The author gives the following tables: 


Appendix A. T(A, a); h=0(.01)3.5(.05)4.75; a=0(.025)1 and «; 6D with first dif- 
ferences in both directions. [At h—4.75 T(h, a) =0 to 6 decimals for 
the range covered.] 

Appendix B. Table of 7(0, a)=are tan a/2x, with first differences a=0(.005)1; 6D 

Appendix C. Table of (1—p*)~? with first differences p =0(.005)1; 7D 


Historically, the most extensive table of the bivariate normal distribution function 
was given in various issues of Biometrika. Pearson unified all of these in his well known 
Tables for Statisticians and Biometricians Part II (now out of print). Pearson tabu- 
lated L(h, k, r), for both positive and negative values of r, calling the function d/N. 
In 1948 Evelyn Fix extended the table, but this work was available only for re- 
stricted circulation. Since, as noted, L(h, k, r) depends on three parameters, a related 
integral which depends on only two parameters was introduced by C. Nicholson in 
1943. This integral was in fact also known to Sheppard. Let 


h kelh g—(z*+y%) 
Vih, k) -f az f a. 
9 0 2 


Then it can be shown that 
Lth, k, r) = V(h, um) + V(k, um) + F, 
where 


u = hu, ») = hv, and u, v have been defined previously. 





1 In the Owen report, L(A, k, r) is designated by M(h, k, p) 
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Also 


1 1 arc sinr Bo te age 
F "TF [a(k) + a(h)] + ? a(z) = ae "du. 


In fact, there is a simple relation between V(h, k) and T(h, k) of the report dis- 
cussed here; namely 


arc tan k 
2x 


In view of the fact that the arc tangent is a well-behaved function everywhere, the 
functions V and T are equally satisfactory, for purposes of computing L(h, k, r). 
Nicholson gave the important properties of V(h, k) along with a table. This table was 
later extended by H. H. Germond. Finally, the National Bureau of Standards collated 
the tables of Pearson and E. Fix; it included an extended table of V(h, \) and V(AA, A), 
for 0.1SA<1. The collated tables were ready in preliminary manuscript in 1953, and 
they are now in press. For this reason a comparison of the NBS tables with those of 
Owen seems worth while. The Owen table comprises essentially 15,750 entries. The 
NBS tables of V (hk, &) contain about 9,300 entries, in addition to the major part of the 
book, namely the tabulation of L(h, k, r). Thus the Owen table is on the whole more 
extensive than the NBS tables for the function 7'(k, a), and interpolation in this table 
will at times be easier than in the NBS tables. On the other hand, there will be applica- 
tions where the NBS tabulation will undoubtedly be more convenient, and vice 
versa, since the parameters in the two tables are somewhat different. 

In format the Owen tables are reproduced by photo-offset from a machine tabula- 
tion. The type is clear and even, but in the opinion of this reviewer, the figures have 
been reduced too much. This was necessitated, no doubt, by the inclusion of the 
first differences, which perhaps might have been omitted without great loss. The 
Introduction, too, is neatly typed on an electric typewriter; but no attempt was made 
to draw any of the mathematical symbols that the typewriter did not have. Thus, 
some of the mathematical expressions appear quite crude, especially exponential 
terms. However, the text is everywhere clear and unambiguous. For an esthetic 
volume of tables, one will no doubt have to wait for Pearson’s new edition. 

The author is to be congratulated for contributing a useful table along with a very 
well written report. 


T(h, k) = — V(h, kh) + 


The Lognormal Distribution, with Special Reference to Its Use in Economics. J. Aitchison 
and J. A. C. Brown. University of Cambridge: Department of Applied Economics, Mono- 
graph 5. Cambridge, England: Cambridge University Press, 1957. Pp. xviii and 176. 
$6.50. 


GERHARD TINTNER, Jowa State College 


TS excellent monograph should be of great interest to mathematical statisticians 
and especially economic statisticians and econometricians. 

Consider a variable Y=log X which is normally distributed with mean yw and 
variance o*. Then X is said to follow the lognormal distribution. This is the two param- 
eter distribution, the parameters being the mean uw and the variance o*. The variate 
X is essentially positive and can take any value between zero and infinity. If X’=X—r 
follows the lognormal distribution we have the three parameter distribution which 
extends from r to infinity. But if X” = (X —r)/(@—X) follows the lognormal distribu- 
tion, we have the four parameter distribution, extending from r to 0. 

The authors give a concise but very useful history of the lognormal distribution. 
General properties of the distributions are discussed, and the genesis of the distribu- 
tion described. This is similar to the derivation of the normal distribution, small errors 
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being multiplicative instead of additive. The theory of estimation is very thoroughly 
presented and illustrated with the help of artificial samples. 

In biological statistical applications the lognormal distribution has been utilized 
in probit analysis, to which a chapter is devoted. The application of the usual statis- 
tical tests to the distribution is discussed. Truncated and censored lognormal distri- 
butions are also considered. An interesting chapter gives a survey of examples of the 
distribution, which cover a range from astronomy to philology. 

Perhaps the most valuable part of the book is the applications of the lognormal 
distribution to economic problems. First we have a short but very illuminating dis- 
cussion of the personal distribution of incomes. Some empirical evidence supports 
the theoretical reasons for using this particular type of distribution. Perhaps the most 
important idea is the application of the Lorenz diagram, which shows that concentra- 
tion of income is related to the parameter o* of the lognormal distribution, and the re- 
lation of the theory to Gini’s coefficient of mean difference. 

The most novel part of the book is probably the pages dealing with the application 
of the lognormal distribution to consumer’s behavior. The discussion centers around 
the Engel Curve, i.e., the relation between the quantity of the good bought and the 
consumer’s income. First the Engel curve is derived in lognormal form for an indivisi- 
ble good, and the theory is extended to divisible goods. For inferior goods we need the 
three parametric lognormal distribution. Questions of aggregation are considered and 
it is shown that under very reasonable assumptions aggregation result again in log- 
normal distributions. The parameters of the lognormal distribution are interpreted 
as the scale of measurement of consumption (parameter of saturation) and of income 
(parameter of cheapness). The income elasticity is a decreasing function of income. 
The influence of prices can be expressed by making the parameter of saturation a 
function of prices. Changes in preferences and the effect of household composition 
are also considered. 

The last chapter deals with computation problems. The tables in the book which 
facilitate estimation procedures and give an analysis of many artificial samples have 
been achieved with the help of an electronic computer. 

The bibliography is quite complete and contains 217 items. It is a great merit, 
that, in addition to the literature in English, works in Dutch, French, Italian and 
German have been considered also. I note only one important omission: There is no 
mention of the earlier work of H. T. Davis on the distribution of incomes (The 
Theory of Econometrics, Bloomington, Indiana, Principia Press, 1941, Chapter 2, 
pp. 17-53; The Analysis of Economic Time Series, Cowles Commission Monograph, 
No. 6, Principia Press, Bloomington, Indiana, 1941, Chapter 9, 386—443). 


Regression Analysis of Production Costs and Factory Operations, Third Edition. Philip 
Lyle. New York: Hafner Publishing Co., Inc. Pp. xvi, 204. $3.00. 


) tan of the second edition of this book by H. A. Freeman and Paul Peach 
appeared in the June and September issues of this Journal for 1947 (Vol. 42, 
pp. 330-31 and pp. 475-77). This third edition, prepared by L. H. C. Tippett after the 
death of the author, corrects the known errors in the second edition. Tippett has also 
“taken the liberty of inserting a very few interpolations which are enclosed in square 
brackets.” He has also added about two pages of very sound and useful comments in 
his Preface to the third edition. In these notes he points out that the book “possesses 
three main features: (1) it is an original contribution to the techniques of cost anal- 
ysis, (2) it contains an exposition of orthodox regression analysis (required in some 
degree for the full development of the cost techniques), and (3) it contains (mostly in 
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appendices) some ‘tips’ on mathematical procedures that have been useful in carrying 
out the analysis described.” W.AW 


Statistical Analysis of Stationary Time Series. Ulf Grenander and Murray Rosenblatt. 
New York: John Wiley & Sons; Stockholm: Almqvist & Wiksell, 1957. Pp. 300. $11.00. 


Lzo J. Tick, New York University and Lockheed Aircraft Corporation 


HIS monograph is mainly a collection of the authors’ results on statistical prob- 

lems of the estimation of spectral functions and regression structure when the 
residuals form a stationary time series. These results have only lately appeared in 
the periodical literature. 

Chap. 1 is a rapid survey of the basic probability notions in stationary time series 
theory. This is followed by a treatment of some least squares problems when the spec- 
trum is known. The major problem in this area is that of prediction. 

Chap. 3 reviews previous work done by statisticians in time series analysis. The 
authors refer to this work as parametric models studies as more restrictive assump- 
tions were made about the process, resulting in a finite-parameter model (e.g. auto- 
regressive scheme, independence, etc.). 

The remainder of the book concerns itself with the large-sample theory of spectral 
furction estimations and the distribution of some of these estimates. There is also a 
chapter dealing with applications. 

Chap. 7 is devoted to estimation of the regression structure. There are theorems on 
the asymptotic efficiency of ordinary least squares estimates as compared to Markoff 
estimates (weighted least-squares taking the correlations into account). The final 
chapter treats a variety of unsolved or partly solved problems, e.g. prediction when 
the spectrum is not known a priori. Almost all the results are for the discrete case. 

The subject of time series analysis has had enormous growth in recent years. 
Stationary processes has proved to be a useful model for an increasing number of 
physical phenomena (e.g. in a recent international conference on ship motions, 1/5 
of the papers were devoted to this topic). There is a considerable number of papers in 
the probability and engineering literature. However, in the newer areas of applications 
(ships, planes) small amounts of data are the rule and efficient inference making with 
known precision, i.e. statistics, has become a prime consideration. 

This subject has not received much attention by professional statisticians. This 
is probably due partly to the difficulty of learning the present state of the art (with 
due respect to publication lead-time.) This book certainly presents the present 
state and presents it well. However, the “art” to which I refer is that of developing 
new theory. This is a book for the mathematical statistician. It is not for the statis- 
tician who is confronted by data (more likely by an engineer bearing same) and is 
interested in prescriptions for their analysis. In this reviewer’s opinion there is not 
sufficient experience at hand to produce such a book. 

A large portion of the Grenander-Rosenblatt work is devoted to proofs and a good 
command of mathematical analysis is required of the reader. 

One of the major shortcomings of the book is its unevenness of presentation. The 
technical demands it makes on the reader vary widely. The first two chapters are 
quite abstract and a considerable knowledge of probability is required. The rest of 
the book requires only analysis. The proofs are difficult and hard to follow, but this is 
usually the case the first time theorems are proved. 

Statisticians and scientists will find the statements of the theorems interesting and 
useful in getting some ideas about variability in this area. The book contains a very 
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fine collection of problems at the end. The student will find much to challenge him 
and test his skill. Instructors should find this a useful text or supplement for a course 
on the mathematical statistical problems of stationary time series. 


Fractional Factorial Experiment Designs for Factors at Two Levels. National Bureau of 
Standards Applied Mathematics Series 48. The Statistical Engineering Laboratory. Wash- 
ington 25, D. C.: U. 8. Government Printing Office, 1957. Pp. iv, 85. $0.50. Paper. 


K. A. BRown EB, University of Chicago 


HE Preface states that this tabulation is a collective product of R. C. Burton, 

W. H. Clatworthy, W. 8. Connor, Lola 8. Deming, F. L. Miller, Jr., H. N. Petti- 
grew, and M. Zelen, the integration and presentation being due to Lola S. Deming. 
It covers designs up to the (1/256)th replication of the 2% experiment, requiring 
256 plots, i.e., no fraction more extreme than 1/256, number of factors greater than 
16, and number of plots greater than 256, is considered. The work consists of a list 
of the designs covering two pages, three pages on the theory of the subject, a bibliog- 
raphy, and 81 pages covering 126 designs. For fractions up to 1/32 the designs are 
written out in full; thereafter, the principal block is given along with one treatment 
combination for each of the remaining blocks. The work thus will show what designs 
are possible and will provide the solutions ready-made. 

In all cases several confounding arrangments are given, and this has led to the 
booklet’s being grossly inflated in size. For example, page 49 is devoted to the 1/16 
replicate of 12 factors in 8 blocks of 32 plots. The two succeeding pages give the same 
design in 4 blocks of 64 plots and 2 blocks of 128 plots. These three pages are identical 
except that blocks 1 and 2 of the first are grouped together to form block 1 for the 
second, and blocks 1 and 2 of the second (formed by blocks 1-4 of the first), are 
grouped together to form block 1 of the third. The addition of two lines at the head 
of any one of these tables would have taken care of both the other tables. Redundancy 
of this type occurs repeatedly and it would be tedious to list all the instances. 

Another form of redundancy is to list the doubly confounded arrangements twice, 
the second time the rows and columns being merely interchanged. For example, 
pages 36-37 have the 1/8 replicate of 11 factors doubly confounded in 32 blocks of 
8 plots and page 39 contains the same design written out, in full, as 8 blocks of 32 
plots. It would be wearisome to assemble all the examples of this procedure. 

This text strictly confines itself to listing the optimum arrangements within the 
specified limits. The authors are entitled to restrict their activities as they wish, but 
from the reader’s point of view it is regrettable that they did not give some dis- 
cussion of a number of closely related topics, including: 

(a) a description of the search techniques used for finding the optimum sub- 
groups. 

(b) randomization procedures to be used with these designs. In the case of doubly 
confounded arrangements, the reviewer would anticipate serious difficulties. Grundy 
and Healy (Journal of the Royal Statistical Society, Series B, XII (1950), 286-91) give 
a procedure for quasi-Latin squares which is adaptable to nonsquare doubly con- 
founded arrangements and which presumably will further adapt to fractionally rep- 
licated arrangements. 

(c) the possibilities of sequencing fractional replicates, i.e., the procedure in which 
one starts with a 1/2" replicate of n factors, and on the basis of the results of this ex- 
periment decides that certain factors can be neglected and on other factors more 
accurate information is required, so that one performs an additional set of treatment 
combinations so that, in combination with what one already has from the first experi- 
ment, one obtains a 1/2” replicate of n’ factors, r’ <r, n’ <n. 
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(d) the possibilities of mixed factorials of the type 4" x2". The use of only two 
levels for all factors is a serious restriction in that no information on the shape of the 
response is given, in the case where the facts are quantitative. Even worse, one runs 
the risk of picking one’s two levels on either side of a maximum, equally down from 
this maximum, so no effect is observed for this factor. 

However, it is clear that at the very modest price of 50 cents one cannot complain 
about the omission of possible additional material. The booklet achieves what it sets 
out to do, completely; that it does it about twice is no drawback. It can be confidently 
recommended to all who may have occasion to consider using these designs. 


Mathematical Economics. R. G. D. Allen. London and New York: Macmillan & Co. and 
St. Martin’s Press, 1957. Pp xvi, 768. $10.00. 


Kenneta J. Arrow, Stanford University 


ny reader of R. G. D. Allen’s Mathematical Analysis for Economists will not be 
surprised to hear that his new book is a masterpiece of exposition. There is the 
same care and patience in development and the same wealth of illustrative examples, 
as well as a miraculous collection of illuminating exercises for the reader. In all these 
respects, both of Allen’s books approach closely the quality of the book that I would 
surmise was their model, G. H. Hardy’s A Course of Pure Mathematics, which I take 
to be one of the greatest textbooks in any field of knowledge. 

The aim of the book is to present “some of the more important and systematic 
parts of mathematical economics.” It is not intended, as was Allen’s earlier volume, 
as a systematic development of some parts of mathematics, but rather of some parts 
of economics. It presupposes the mathematical knowledge of the previous volume; 
where this is inadequate, it presents additional mathematics in chapters separate 
from those dealing with economics. The general pattern is to pursue an economic 
topic until the need for additional mathematics becomes clear, then stop to develop 
the necessary mathematics, and finally to return to the original topic with the more 
powerful tools developed. As a piece of pedagogy, this seems remarkably successful. 

The book falls into two parts. The first 313 pages are devoted to macro-dynamic 
models, mainly of the post-Keynesian variety such as those of Hicks, Goodwin, 
Kalecki, and Phillips. Nowhere else will it be possible to find easily available such 
excellent studies of the mathematics of distributed lags, Laplace transforms, and 
the use of concepts derived from electrical engineering. Considerable stress, too, 
is placed on the economic meaning of the lags involved; most models implicitly 
assume that some markets are cleared while others are not, and the behavior of the 
model usually depends quite strongly on the precise assumption made. The role of 
money and prices is given very little stress; in this respect, Allen is representative of 
the literature he summarizes. 

The next 380 pages are devoted to price theory and general equalibrium, but from 
a modern point of view which makes extensive use of linear programming and input- 
output analysis; there is also a chapter on the theory of zero-sum two-person games. 
Matrix algebra becomes the dominant mathematical tool, and three chapters are 
devoted to its exposition, though calculus is used in the usual way in presenting the 
theory of consumer demand and also the theory of the firm (as an alternative to the 
linear programming formulations). The study of general equilibrium does not get 
beyond the usual counting of equations and unknowns; though the author recognizes 
that the equality of the two is not sufficient for the existence of equilibrium, he makes 
no reference whatever to the literature on this subject, which goes back to Wald’s 
wok in 1933. 
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A final chapter deals with the aggregation problem, completely along the lines 
developed by Theil. Indeed, it is only this last chapter in which the book deals with 
problems of direct concern to the readers of this Journal. It would be unjust to find 
fault with a book of 768 pages in which there does not appear to be any excess verbos- 
ity because it has omitted certain topics. Nevertheless, it may be useful to note, 
without critical intent, the virtually complete absence of precisely those areas in 
which economics and statistics most closely abut. Even on aggregation, while the 
coverage of Theil’s important work is excellent, there is no mention of the classical 
theory of index numbers or of its extension to functional index numbers (where use is 
made of an entire Engel curve) by Frisch, Wald, and others, although this topic is one 
of the most illuminating applications of consumers’ demand theory to empirical 
analysis. 

The relations of probability theory to economics are not touched on, except for a 
brief reference to the Bernoulli and von Neumann-Morgenstern-Ramsey utility 
theories. Economic behavior in risky situations is simply not discussed, though the 
theory of investment in both fixed capital (see Hart) and in inventories really has 
been greatly illuminated by an appreciation of the role played by the uncertainty 
of the future (actually, though theories about investment behavior play, of course, 
an important role in the macro-dynamic sections, there is no micro-economic discus- 
sion whatever). Finally, though the explanation of cyclical fluctuations as a stochastic 
process has been a recurrent, though perhaps not dominant theme, since the work 
of Yule, Slutzky, and Frisch, the macro-dynamic models are exclusively deterministic. 

Allen’s book will be an important reference for the mathematical economist, par- 
ticularly in the development of macro-dynamic models, and an indispensable adjunct 
in teaching. It has less significance for the statistician. 


Introduction to Mathematical Economics. D. W. Bushaw and R. W. Clower. Homewood, 
Illinois: Richard D. Irwin, Inc., 1957. Pp. xii, 345. $7.00. 


Victor E. Samira, Michigan State University 


sipE from R. G. D. Allen’s new book, recent textbook material for a course in 
mathematical economics has been limited to books designed for the teaching of 
mathematics rather than economics. Uuseful as these books have been, their orienta- 
tion and organization have been mathematical; bits and pieces of economics appear 
but in the form of illustrations of mathematical principles, rather than in their own 
right as the center of attraction. The student gains facility in mathematical manipula- 
tion and becomes acquainted with numerous functions useful as representations of 
economic phenomena, but he does not study economics as a coherent system of 
thought. 

Probably introductory texts in mathematical economics would have been less 
scarce were it not true that an appropriate order of presentation for the economic 
material is liable to be heartily inappropriate for the mathematical material. Bushaw 
and Clower have met this problem partly by dealing with the economics of markets 
(both isolated and interconnected) before taking up the theory of the consumer and 
the firm and partly by consolidating the mathematical tools needed in Part II of the 
book (“The Mathematics”). Thus Part I is permitted to be a treatise on economics. 
Whenever a mathematical technique is used for the first time the reader is referred to 
the appropriate section of “The Mathematics.” In general, the student who turns to 
the mathematics only when thus directed will find Part II so written that this order 
of presentation of the topics is feasible, although it is not the order a mathematiciaa 
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would have chosen if mathematical convenience alone had determined the arrange- 
ment. The student who prefers a more mathematical arrangement may work through 
Part II from its beginning. He will find it a concise text covering a good part of the 
mathematics used by economists, but not such topics as matrix algebra and set theory. 

The student of Part II is expected to have “a tolerable level of accomplishment in 
elementary algebra,” plus a good deal of maturity. “The Mathematics” begins with 
functions, graphs, and equations, but proceeds fairly rapidly through chapters on the 
differential calculus, definite integrals and Taylor series, quadratic forms and de- 
terminants, difference and differential equations, and maxima and minima. It is a 
book to be worked through, not simply read through. The demonstrations are com- 
plete, but concise. The exercises induce thought as well as provide practice. 

The student who possesses only college algebra and has only a modest aptitcde for 
mathematics may find the pace too fast. The treatment of difference and differential 
equations, for instance, is more abstract and more compressed than in Baumol’s 
Economic Dynamics (Macmillan, 1951). Nor will one find here such an abundance 
of algebraic functions appropriate for the use of the economist or as many opportuni- 
ties for practice in manipulation as one finds in R. G. D. Allen’s Mathematical Analysis 
for Economists (Macmillan, 1939) or Tintner’s Mathematics and Statistics for Econo- 
mists (Rinehart, 1953). 

The great advantage of Part I, “The Economics,” is that the economic problem re- 
ceives the principal emphasis. It is not brought in merely to illustrate the mathemat- 
ics. The text begins with the analysis of an isolated market, requiring only simple 
algebra. Then stock and flow markets are distinguished; investment demand is dis- 
cussed; and a model for an isolated market having both stock and flow demands is 
presented, the algebra meanwhile becoming slightly more complicated. Next come 
models of multiple markets and the explicit recognition of money. The calculus first 
appears in connection with a discussion of these models from the viewpoint of com- 
parative statics. 

The distinctive emphasis of the book is on dynamic models, in the sense of models 
in which the values of the variables are functions of time. The major problem of the 
book is the problem of stability, in isolated markets and in multiple markets. Thus 
differential and difference equations appear in Chap. III, with an analysis of con- 
vergent and explosive systems, and derivation of the stability conditions. By the time 
one has finished Chap. IV, on the dynamics of multiple markets, he has gained a 
rather thorough acquaintance with the problem. 

Consumer behavior and the theory of the firm do not appear until Chaps. V and 
VI. The theory of consumer choice is essentially that of Hicks’ Value and Capital 
(Oxford, 1939), but with the indifference map given an axiomatic basis. The authors 
develop the comparative statics propositions of Hicks’ mathematical appendix 
rather carefully before going on to present their own stock-flow model and a dynamic 
model of consumer choice. 

The theory of the firm, in addition to conventional input and output equilibrium 
analyses, presents a generalized flow model, a stock-flow model, and a simple dynamic 
model for output adjustment. The final chapter of “The Economics” includes two 
sections which deal with price determination in a market which is not purely com- 
petitive. This is a stimulating formulation, quite different from the usual analyses 
of imperfect competition. 

In one instance the spirit of innovation that characterized this book has extended 
to terminology. “Macroeconomics” in this volume does not refer to aggregative eco- 
nomics, but to the economics of a market or markets. The authors’ only venture into 
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aggregative economics, an eight-page appendix on the Keynesian system, attached to 
Chap. II, does not strike me as either helpful or successful. 

The book has very few typographical errors. 

It is apparent that this is not merely an intermediate theory text restated in 
mathematical terms. The volume raises questions which are significant for the 
economist; it deserves to be read for this alone. At a time when dynamic models, 
despite their importance, are largely neglected in theory texts, the emphasis of this 
book will help redress the balance. Unfortunately, many students who wouid have 
been fascinated by dynamics in the form of growth and cyclical models will be in- 
clined to consider the same dynamics applied to the fundamental question of market 
stability as too academic to deserve their attention. 

Since the book was directed toward an audience both better grounded in economics 
and more mature than the usual advanced undergraduate or beginning graduate 
student, it may prove a bit too taxing when used as a text for such students, if they 
are not already quite well trained in mathematics. Yet I feel that it will be so used, 
both because of its intrinsic interest and because it has the great advantage of making 
the necessary mathematics available to the serious student, along with the economics, 
and at the time when needed. 


Linear Programming: An Explanation of the Simplex Sone. Dakota Ulrich Greenwald. 
New York: The Ronald Press Company. Pp. vii, 75. $3.00. 


Davip Vauinsky, City College of New York 


EsPITE the vast volume of writing that has been done in the general field of 

linear programming during the past decade, few of the contributions to the 
literature have been full-scale books and, until the present volume appeared, none 
has been primarily directed to the interested but nonmathematically oriented 
student. 

Patently, it is impossible to encompass an adequate discussion of the whole com- 
plex field of linear programming within the confines of a small volume, and, the 
book’s title to the contrary, the author has not attempted to do so. His purpose is 
properly described in the volume’s sub-title, which aptly describes its content. 

Linear programming, which is a planning tool of management, is concerned with 
finding the optimum pattern of the variables and restrictions entering into industrial 
or military programming activities. The present volume deals with the “simplex 
tableau,” an algorithm which has proved to be the most useful of many techniques 
applicable to the solution of linear programming problems. It is particularly well 
adapted to small-scale problems, but, under some circumstances may be expanded to 
deal with problems of considerable size. 

Essentially, this book has been designed for senicr and graduate engineering or 
management students and makes no demands for advanced mathematical back- 
ground. It comprises an exposition of the simplex algorithm as applied to linear pro- 
gramming problems susceptible of solution by hand computation. 

The plan of the book is simple and straightforward. It limits itself to an exposition 
and demonstration of 19 problems involving application of the subject to work sched- 
uling, product mix under given selling prices and production limitations, minimiza- 
tion and nonoptimum programs, minimization of distribution costs, etc. Eight of these 
problem areas exemplify the use of the simplex tableau for their solution and are used 
for demonstration of method. The remaining 11 are presented for solution by the 
reader, and serve to familiarize him, by practice, with the use of the algorithm in 
practical problem situations. 
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_ The author suggests that the book may be used as a self-teaching manual or as a 

classroom text. Within its limitations, both uses are certainly possible, although the 
absence of any proofs and a somewhat inadequate presentation of principles would 
make it difficult for the student ever to understand all the reasons for the procedures 
he is instructed to employ in solving the problems presented. 

Because of the approach taken by the author, preliminary explanations are some- 
what long and slightly involved. A more direct mathematical treatment might, in 
fact, be simpler and easier to follow for the serious student genuinely interested in 
the subject. 

Since the book offers problems requiring only hand computation for their solutions, 
only a limited number of restrictions can appear in the examples. This is certainly in 
keeping with the author’s intent of providing a work within the scope of the non- 
mathematically oriented student. The reader should be aware, however, that the 
problems given by the author by no means exhaust the range of variables or restric- 
tions which may enter into the application of linear programming to management 
planning. Many of these require advanced techniques and machines and some of 
them exceed the available current resources of both. 

On the other hand, since the author intends his work to be used either as a self- 
teaching manual or as a classroom text, the limitation of variables and restrictions 
may serve as an advantage, since it becomes easier to understand and to follow the 
processes of problem formulation and solutica. 

Readers actually engaged in management f ‘actice may desire greater emphasis on 
the whole area of problem formulation within the bounds of linear programming 
solution than it has been possible for the author to include in the present work. Within 
the limits of its stated purposes, however, the book fulfills its promise and can be 
recommended. 


Naval Research Logistics Quarterly, published by the Office of Naval Research (Com- 

mander M. I. Rosenberg, USN, Managing Editor), distributed by the Superintendent of 

Documents, U. 8. Government Printing Office, Washington 25, D. C. Subscription price: 

$1.50 a year in the U. S. and Canada, $2.00 elsewhere; 50¢ for a single copy. Volume 1 
appeared in 1954. Approximately 300 pages per volume. 


4 ber Naval Research Logistics Quarterly is devoted to the dissemination of scientific 
information in logistics and will publish research and expository papers, including 
those in certain areas of mathematics, statistics, and economics, relevant to the over- 
all effort to improve the efficiency and effectiveness of logistics operations.” 

This policy has been followed by publishing diversified articles of interest to an 
audience much wider than suggested by the title. A rough classification of the eighty- 
one articles in the first three volumes follows. Twenty-seven articles of a nonmathe- 
matical nature appeared on the nature of logistics, specific field applications of 
logistics, and related economic questions. The theory of games, linear programming, 
and dynamic programming have been developed and exposited in thirty-one papers. 
New mathematical models useful in logistics, inspection, and inventory control have 
been presented in fifteen papers. Six papers emphasizing the role of statistics in 
applying logistics techniques and two papers in theoretical statistics have appeared. 
Also each issue has contained several book reviews. 

Nearly all of the articles are concerned with decision making and many of them 
involve uncertainty (probability). Because of this statisticians with a broad outlook 
will find Logistics Quarterly useful and fascinating. 

I. R. 8. 
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Work Sampling. Robert Heiland and Wallace J. Richardson. New York: McGraw-Hill Book 
Co., Inc., 1957. Pp. x, 243. $6.00. 


W. Epwarps Demine, New York University 


nis book strikes the statistician as a work that is authoritative in the subject 

matter, and enthusiastic in the use of statistical techniques. Work sampling was 
first used by Tippett in 1935 when he applied statistical sampling to the problems of 
the breakdown of looms in a textile*mill in England. It was 1940 when Robert Lee 
Morrow described Tippett’s technique (which Tippett had called a snap-reading 
technique) by the term studies of ratio-delay, and saw it as a method for establishing 
allowances for delays in connection with time studies. It is a pleasure to find in the 
preface this correct historical information. The term work sampling is probably a 
better term than ratio-delay because it more properly describes the technique, and 
it may be used for many types of quantitative studies of machines and of workers. 

Work sampling consists of observing at a sample of random instants what a machine 
or a man is doing; and if idle, the cause. Statistical theory, in its numerous ramifica- 
tions, is putting science into management and administration. Work sampling is one 
of the important uses of statistical theory. The main statistical tools in this book are 
the Shewhart charts and various tests of significance, to aid the discovery of causes of 
delay. 

Illustrations and case histories come from (1) a chemical plant; (2) industrial con- 
struction; (3) fixing production rates; (4) switch-engines in a railway yard; (5) proc- 
essing of invoices and vouchers; (6) utilization of personnel in plant maintenance; 
(7) allowances for delay in wage incentives; (8) utilization of equipment; (9) delays 
in the use of a crane; (10) layout of a service-station; (11) data-processing; (12) many 
other sources. 

The uses of work sampling are these: (1) as an aid in defining the problems; (2) 
to establish goals for supervision; (3) to determine the rature and extent of cyclic or 
peak-load; (4) to study needs of equipment; (5) to study requirements of manpower; 
(6) as a measure of performance; (7) to help establish job content as an aid to job 
evaluation; (8) to help supervisors organize their time; for example, the common 
complaint of too much paper work can be evaluated by work sampling; (9) appraisal 
of safety measures. Work sampling can be used to supplement safety inspection by 
the use of special codes to indicate hazardous working conditions; (10) appraisal of 
effectiveness; (11) observations of specific policies of management; (12) (with caution) 
determination of time standards and allowances. 

The authors are clear that the observations in work-sampling and in fact in any 
kind of sampling must be taken at random, and that randomness can only mean in 
practice the use of a table of random numbers. The authors eventually come to this 
point on pp. 111 ff. The short table of random numbers in the appendix will serve to 
show the nonstatisticians what a table of random numbers looks like. 

The book is written for the man who wishes to carry out work sampling. It shows 
suggested forms and complete procedures for many kinds of machines and factories. 
It shows also examples of presentation of results and what action they indicate. It 
emphasizes the need for careful planning, in order to collect information that will be 
of maximum utility. It warns the user to be on guard against errors of reporting and 
other nonsampling errors. The statistical procedures for design and for interpretation 
are clear and interesting. 

From the statistician’s view, one could offer a number of criticisms, but where the 
book ‘ails in the mastery of statistical theory, it more than repays in enthusiasm and 
in sensible application. 
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A long treatment on samples from a normal distribution is perhaps a trifle unfor- 
tunate in a work of this kind, as it may lead experts in the subject matter, unac- 
quainted with statistical theory, to suppose that the frequency distributions of delays 
dre or ought to be normally distributed, or (worse) that the techniques described in 
the book are not valid unless this condition is fulfilled. Actually, the utility of the 
Shewhart charts is relatively insensitive to the distribution of the original observa- 
tions. Moreover, many other statistical formulas and techniques do pretty well under 
the handicap of moderate skewness. 

The authors, in an attempt at rigor, say in the footnote on p. 95 that the standard 
deviation of an estimated proportion p’ is ~/p’(1—p’)/(n—1 where p’ is the “true” 
proportion (without an operational definition of the “true” proportion). The struggle 
with statistical theory continues: “When attempting to estimate p’ from a single 
small sample, however, there is very little precision possible anyhow, and use of 
n—1 will increase ¢, making the estimate even more unprecise!” 

it is a pleasure to recommend this book as one that is undoubtedly sound in sub- 
ject matter, and clear in the statistical procedures. The book is interesting and in- 
viting; the typography is excellent; the illustrations abundant. Finally, it is good to 
see statistical principles and techniques recognized as essential in textbooks on 
quantitative analysis of subject matter. A new day is about to break in administration 
and in management: this book will advance the hour of dawn. 


Control Estadistico de la Calidad (Tecnologia Industrial). E. Blanco Loizelier, Madrid, 
1954. Pp. xxiv, 435. 


Joree Arias B., Universidad de San Carlos de Guatemala 


I. this book the author has made a worthwhile effort to present to the Spanish 
speaking reader an exposition of the principal statistical methods used in quality 
control. Exclusive reference is made to industrial production, a field in which the 
author pioneered in his native country, Spain. 

The author expresses the hope that the publication of his book will facilitate the 
adoption of statistical control methods by interested industries. He does not specify, 
exactly however, for whom he primarily intended the book and this complicates the 
reviewer’s task. 

The book begins by summarizing the development of quality control methods in 
Europe, Asia, and the United States, but not in Spain. Chap. I, which introduces the 
study of frequency distributions, is couched in terms so elementary that the book 
seems to have been written for those without any knowledge of statistics. Chap. II, 
in contrast, defines the meaning of control, introduces the Shewhart postulates, and 
opens with the normal distribution and its characteristic function. In view of the 
book’s purpose, these chapters, which constitute an introduction to the study of 
probabilities, seem inadequate with respect to both extension and clarity. 

The reader is introduced to the control chart in Chap. III. There are quite a few 
graphs that look interesting but, unfortunately, these are neither described nor 
referred to in the text. Chap. IV, in which, incidentally, appear some misleading 
changes in notation, continues with the presentation of control charts for variables 
and attributes. 

The risk concept is lightly taken in Chap. V; VI is on simple, double, and se- 
quential sampling. 

Chap. VII, titled “Investigation and Experimentation,” includes the probability 
graph, regression and correlation, an introduction to analysis of variance, and a study 
of the “x?”, “t”, “F”, and “R” distributions for which the author employs the moment 
generating functions. The level of this treatment is not in accord with that of the 
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previous chapters. The condensation of so much material in so small a space (under 
30 pages) reduces the amount of theory to a minimum. 

Chap. VIII is on the application of the methods exposed to industry and gives 
many practical actual examples. 

The last two chapters, IX and X, are purely supplementary. The former discusses 
the human factor in statistical quality control and emphasizes the role of the in- 
specting personnel. The latter brings in the use of the slide rule, the Bender and 
Quickie computers, and other such auxiliary devices, with reference to the computa- 
tion of limits for the different charts. 

The book has two appendixes—a bibliography and a collection of tables. In the 
first appendix is a bibliographical reference cross-table for the individual chapters 
which gives the author and the page but omits the name of the book and its editor. 
(In general the bibliographical references which follow each chapter are limited in 
their usefulness because they do not include the editor’s name and the country of 
publication.) This appendix also contains a list of magazine articles (confined for the 
most part to North American publications) covering the periods 1949-1950 and 
1943-1949. It must be added that, although the book was finished in 1951, publica- 
tion was delayed until 1954. 

The last 160 pages contain some auxiliary tables for computing and graphing 
control limits, the Dodge and Roming tables, and the MIL-STD-105A of the U. 8. 
Army. 

The reviewer would like to bring out some points which would surely mislead 
anyone without some training in statistics. For example, a frequency curve is ad- 
justed through the projecting corners of a histogram. Also, the book says that one of 
the advantages of the median is that it “is based in the calculation of all observa- 
tions”; or that (in relation to the probability graph) “taking a logarithmic scale for 
the ordinate axis, we transform the accumulative curve of the theoretical normal 
distribution into a straight line.” 

The reviewer feels that this book contains much valuable material which, con- 
sidering the author’s theoretical and practical background, could be used advan- 
tageously in writing a better one. He feels also that such a book would be materially 
improved by a greater emphasis on theory and by a more complete interpretation of 
the problems presented. 


Economic Policy: Principles and Design. Jan Tinbergen. Amsterdam: North-Holland 
Publishing Company, 1956. Po. xxviii, 276. $7.00 (26.50 guilders). 


See review article by Kenneth J. Arrow, pp. 89-97 in this issue. 


Fluctuations, Growth, and Forecasting: The Principles of Dynamic Business Economics. 
Sherman J. Maisel. New York: John Wiley and Sons, Inc., 1957. Pp. xv, 552. $7.50. 


Emer C. Bratt, Lehigh University 


0 AN extent never before attempted in the study of aggregative economic change, 

this book places decision-makers in a central position. Wherever possible, neces- 
sary decisions are introduced and the process of reaching them is coordinated with 
aggregate economic change; it is true that the method could not be used for a con- 
siderable part of the book, necessitating resort to more conventional presentation. 
Also, parts of the presentation on decision-making seem but little related to economic- 
change problems. The book is well written and frequently sparkles with human 
interest. Where decision-making cannot be used, central analogies with well-under- 
stood human experiences are widely employed. 
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A difficulty the author faces is our ignorance with respect to the anticipatory 
process. As we learn more about anticipatory variables and the anticipatory process 
is more fully investigated, greater plausibility will be lent to simplified statements on 
how decisions can be derived from an understanding of aggregate economic change. 
But progress along these lines is likely to be slow and Maisel’s analysis will not 
quickly become obsolete. 

Some attention is given to all phases of economic fluctuations, economic growth, 
and seasonal movements. For the purpose of analysis the principal measurements 
deveioped are income and product data and useful comments are made on measure- 
ment and analysis in this area. As the writer states, it was necessary to sacrifice 
“greater depth in order to obtain wider coverage.” Centering on the decision-making 
process, the conclusions required are emphasized whi'e frequently the analysis neces- 
sary to reach the conclusions is only indicated. The more traditional approach is to 
emphasize the analysis and give only cursory attention to applications. Interest may 
well be generated by emphasis on decisions, and the average student may not object 
to a slurring over the analysis. 

A significant departure is the frequent use of cases. Some of the cases are superb, 
notably projection of gross national product for 1955 (pp. 411 ff.) and the case history 
of the 1954 recession (pp. 471-473). Some others are difficult to appraise because it is 
not clear how much they have been modified by the author for purposes of exposition 
and to what extent he would support the conclusions reached. For instance, the only 
presentation on projected secular growth of gross national product is a (presumably) 
hypothetical case which holds that future productivity rise might be anywhere from 
1.5 to 3.5 per cent (pp. 52-59); and of housing which holds that the full employment 
rate will be 8 to 9 hundred thousand a year (pp. 239-240). Many students will face 
such problems by similar case presentation in later life. Should they now have a fuller 
analysis of the methods used in developing the cases? 

It is not easy to integrate decision-making with aggregative economic change. One 
illustration is the position Maisel takes with respect to the tendency of businessmen 
to decide on investment changes after a movement has occurred in sales; he holds 
that there is a lag in investment in general (pp. 189-190), without noting that no such 
average tendency shows up in aggregate economic data or that the opposite conclu- 
sion was reached from the accleration principle. The book does not divide readily 
into conventional sections; one of the five major divisions is labelled forecasting. al- 
though only half of the pages relate to forecasting and a considerable part of the 
rest of the book relates to that subject. 

The fact remains, however, that Maisel has provided a new organization of major 
importance. It will have a wide influence on teaching and writing in the field of 
aggregate economic change. No claim would be made that new facts or basic analysis 
is introduced, but competent understanding is demonstrated. 


Quantity and Price Indexes in National Accounts. Richard Stone. Paris: Organisation for 
European Economic Cooperation, 1956. Pp. 120. $1.50. Paper. 


Martin J. Batter, University of Chicago 


_— work is a companion volume to the earlier work by the same author, A Stand- 
ardized System of National Accounts, and is built around the questions involved 
in obtaining an index of the real gross product of an economy. It is intended as a 
suggested basic text or guide for civil servants anc. others who are trying to construct 
such an index, and presumably should be judged mainly in terms of its usefulness and 
appropriateness to this end. There is no attempt to discuss the special problems con- 
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nected with other types of index numbers than those involving gross product, nor of 
the purposes which such other indexes may serve; although most of what is said is 
equally applicable to many or all types of index numbers. 

A question naturally arises as to whether it is worth while to build a discussion of 
index numbers entirely around the system of national accounts, when most of the 
users of the work will be interested in the construction and application of other types 
of index numbers as well. A chapter at the end, entitled “Familiar types of index- 
numbers and the system presented here,” touches lightly on the subject of “index- 
numbers in common use,” but is too terse to be of any help on the special problems 
concerning such other indexes. It is true that a more thorough discussion of other 
indexes would lengthen the book substantially; but on the other hand, their treat- 
ment in separate works must involve considerable duplication. 

Notwithstanding the limitation of its framework, this book contains a lot of excel- 
lent discussion of conceptual problems and problems of measurement in the con- 
struction of index numbers, most of which is not available elsewhere or is available 
only in bits and pieces in scattered articles and books. On the whole it is pitched at 
about the right level for its intended audience, although its very smooth style con- 
ceals many inconsistencies in its apparent assumptions about the level of training 
of the reader. (For example, in Chap. III, Sec. 5, the reader has to have explained 
the elementary characteristics of Laspeyres and Paasche indexes, while he is assumed 
in Sec. 5 of Chap. VI to know what is meant, in the statistical sense, by “the com- 
ponent .. . of F* orthogonal to S and T”; he is also supposed to be fluent in matrix 
algebra and statistical estimation. At many points, too, the reasons for choices made 
between alternatives or for analytical steps taken are very tersely or even vaguely 
stated for a work of this kind.) The strongest sections are Chaps. [V through VI and 
Chap. VIII, all dealing with problems of a very practical and perplexing kind; quality 
differences, intermediate services, seasonals, general measurement problems. These 
subjects are not developed here as fully as eleswhere, but the discussion here con- 
denses a lot of good sense into a small space. The only serious omission that occurs 
to the reviewer, in these chapters, is an adequate discussion of the units of measure- 
ment for services. The author asserts (p. 63) that if a motor car factory were set up 
in San Francisco which could produce cars there as cheaply as in Detroit “then this 
event will result in a fall in the price of cars in San Francisco relative to the earlier 
base period and not in a fall in quantity”; but he does not say how this will be handled 
in terms of the units of measurement of the output of the distribution sector (trans- 
port, retail, etc.). On the contrary, everything he does say on distribution cuts the 
other way. 

Two closely related chapters which seemed to the reviewer to go part way into a 
subject without going far enough are those concerning the economic theory of index 
numbers (Chap. ITI) and the distinction between real product and real income (Chap. 
VII). The first of these two chapters gives the reade. a tantalizing glimpse of some un- 
conventional approaches to the valuation of the social product and then brusquely 
tells him that index numbers (in this context meaning only weighted arithmetic or 
harmonic means) are not “without use or interest,” as measures of central tendency, 
notwithstanding their arbitrary definitions. This says nothing, however, about 
whether an arithmetic mean is better than, say, a geometric mean in this connection. 
The author does suggest modifying any analysis based on conventional index num- 
bers by taking account of known facts of consumer behavior, but he does not see 
any middle ground, in the actual systematic calculations of indexes, between con- 
ventional indexes thus interpreted and a full-blown constant utility index based on 
complete empirical studies of consumer demand and of the distribution of income and 
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of demand patterns. Chap. VII, which deals with the deflation of different income 
and product concepts such as consumption, saving, and investment, is handicapped 
by this hasty abandonment of the welfare aspects of index number theory. Partly 
as a result of this, and partly as a result of the intrinsic difficulty in interpreting the 
concept of real saving (or real investment), little is accomplished here beyond the 
suggestion that real income might be defined either to mean real factor inputs or 
command over consumer goods, and that in either case it cannot be expected to equal 
real product outside the base year. 

For the practitioner concerned with improving the accuracy, coverage, and ele- 
gance of a price or quantity index, however, this book should prove as valuable as it 
set out to be. The gaps mentioned above are, after all, relatively minor and can be 
covered by moderate recourse to the references provided at the end. It should be 
considered as a companion volume not only to A Standardized System, previously 
mentioned, but also to The Measurement of Production Movements, in whose writing 
Stone collaborated, which discusses many of the practical problems of measurement 
for index numbers in fuller detail. 


National Income Accounts and Income Analysis, Second edition. Richard Ruggles and 
Nancy D. Ruggles. New York: McGraw-Hill Book Company, 1956. Pp. viii, 452. $6.50. 


Davin I. MetseLMaN, National Bureau of Economic Research 


MM“ readers of the 1949 edition of this popular text will be pleased that the 
second edition builds upon its format and general approach to the content of 
most intermediate and advanced courses in national income. The new edition, how- 
ever, is no mere re-issue of a well-received textbook. It has added one author, Nancy 
D. Ruggles, wife of the original author, 5 chapters, and 100 pages of text. The most 
important of these additions is Chap. 5 and the discussion leading into it, in which 
the authors recast the U. S. national accounts. Several chapters have also been re- 
vised and brought up to date. 

The book is divided into two parts of equal length. Part I covers the accounting 
aspects of national income and Part II the analysis of national income. The chapters 
on national income accounting retain the strong points of the first edition and remain 
the skeleton around which the entire book is formed. This section is developed out of 
a brief exposition of the purposes and content of the principal accounts of the firm, 
the balance sheet and the income statement. The authors then move on to derive 
national accounts from the accounts of the firm, weaving into the exposition an ex- 
tensive discussion of the conceptual and operating problems of measuring and using 
the national income accounts. 

New material has made Part I more instructive to the college student and the 
user of the national income accounts, and more interesting to the social accountant. 
Social accountants should pay particular attention to Chap. 5 and its appendix in 
which the authors present their own set of national accounts and then compare 
them with current U. 8S. and U. N. systems. The Ruggles’ set of accounts suggests 
basic improvements in the U. 8. accounts which, if adopted, would greatly enhance 
their usefulness. In essence, the authors present a reclassification of the national 
income accounts into more analytically meaningful sectors and categories of ex- 
penditures and receipts. Their most important innovation is a sectoring of the econ- 
omy predominantly by function rather than by transactor, with sector accounts for 
consumers, producers, and government. In addition to this they reclassify expendi- 
tures by function so that the composition of and shifts among the principal compo- 
nents of consumption, investment and government expenditures can be observed in 
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the main body of the accounts. They further reclassify receipts of income and taxes 
by sector in which the payments originated, and by type of tax paid or factor service 
rendered. Thus, the accounts move towards a more functional recording of each flow 
of income and product on a simultaneous “to-whom,” “from-whom,” and “for-what” 
basis. There are many other challenging and useful innovations in the details of the 
accounts which deserve attention. Many of the innovations and details of Part I have 
been incorporated in the recent report of the National Accounts Review Committee 
of the National Bureau of Economic Research, which recommended basic revisions 
of the U.S. accounts.! 

Two appendixes which have been added to Part I are valuable because they sum- 
marize large bodies of information and offer a short course in national income account- 
ing. One 22-page appendix, largely a condensation of Part III and Part IV of the 
1954 edition of the National Income Supplement, describes the sources of data and 
estimating procedures currently employed by the Department of Commerce to 
measure U. S. flows. It also discusses the reliability of sources and methods. The other 
appendix compares the structures of the accounts of the U. S. and U. N., and those 
of the authors themselves as examples of alternative attempts to solve many of the 
general problems of national income accounting. Readers who have some acquaint- 
ance with social accounting may prefer this short treatment to the more extended 
one in the book. 

The chapter titled “Related Systems of Economic Accounting” contains a new 
section on the moneyflows or flow-of-funds accounting system and an extension of 
the exposition of input-output tables which appeared in the first edition. The treat- 
ment of complementary accounting systems and bodies of data would be improved by 
broadening the discussion to include wealth and savings estimates, possibly of the 
kinds developed by Raymond Goldsmith and the Securities and Exchange Com- 
mission. These omissions may reflect the theoretical presumptions of the authors 
whose analytical views are close to those of the Keynesians who focused their atten- 
tion almost exclusively on the income statement and tended to neglect wealtl. vari- 
ables or the forms which are taken by the flows of savings. 

Although the first half of the book is a valuable self-contained description of the 
accounts which record the behavior of national income, the second half of the book 
which sets out to present a framework for the analysis of national income behavior 
and related public policy implications is, in this reviewer’s opinion, subject to criti- 
cism. Additional data and discussion have been added to Part II, but the authors re- 
main too little cognizant of the changing face—since the late 1930’s—of the economy 
and of the theory useful to analyze its aggregate movements. In the authors’ econ- 
omy, wage rates bear little relationship to employment or output; changes in the 
quantity of money have little impact on the price level; and investment expenditures 
are largely independent of the very widest fluctuations in interest rates. On page 362, 
for example, we find that a rise in interest rates from 4% to 8% is associated with a 
decline in investment expenditures of but $2 billion from a level of $30 billion, or an 
interest elasticity of approximately 0.03. This observation hardly conforms to the 
United States experience with the higher interest rates of the past few years. Fur- 
ther, little more than perfunctory discussion can be found of the roles of financial 
variables. Thus, students will be forced to look elsewhere for the tools required to 
analyze such current problems as the inflationary effect of labor unions, the impact 
of contra-cyclical monetary policy, and the consequences of higher interest rates. 





1 The report of the Committee, The National Economic Accounts of the United States: Review, Appraisal, and 
Recommendations, is contained in Hearings before the Subcommittee on Economic Statistics of the Joint Economic 
Committee, 85th Cong., Ist sess., October 29 and 30, 1957, and has been reprinted by the National Bureau as No. 64 
in ite General Series. 
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Students must also look elsewhere for the content and the flavor of the exciting 
controversies of the past twenty-five years regarding macro-theory and public 
policy. The book presents little in the way of alternative theoretical or policy views 
and fails to develop adequately the intellectual backgrounds of the positions taken 
by the authors themselves. The 226 pages of Part II contain no suggested readings 
and but one minor footnote reference to the rich and vigorous literature. These 
bibliographical deficiencies are especially grave in view of the controversial nature of 
many of the areas the second half of the book attempts to cover and the advanced 
standing of the students to whom the book is primarily addressed. 

Those who require a good exposition of national income accounting will find one 
here. Many who also require powerful tools to analyze the contents of the filing 
cabinet will feel compelled to turn to other sources. 


The Demand for Aluminum: A Case Study in Long-Range Forecasting. Business Study 
No. 10. James E. Rosenzweig. Urbana: Bureau of Economic and Business Research, 
University of Illinois, 1957. Pp. 67. $1.00. Paper. 


Paut Boscuan, The Guardian Life Insurance Company of America 


ERE metal producers and consumers, government agencies, members of the 

financial community, and consultants of every description to empty their files 
under the letter A, no doubt a veritable deluge of forecasts of demand for aluminum 
would occur. No doubt, too, all the formal methods, such as trend projection, correla- 
tion analysis, end-use surveys, and so on, would be in evidence. It would be interest- 
ing to make a comparative study of the methods used and the results achieved. 
Unfortunately, the material is usually confidential or otherwise inaccessible. Thus, 
a major contribution of Rosenzweig’s study is to present forecasts for public inspec- 
tion. His purpose is to evaluate the relative merits of various standard forecasting 
techniques, unhampered—but also unguided—by any special economic research 
objective. The first chapter of the pamphlet summarizes the findings of the author. 
The next two, which are largely quantitative in character, discuss forecasting by 
trend projection and by ccrrelation techniques. The last chapter is largely qualitative 
and concerned with the “Outlook for Products made from Aluminum.” It culminates 
in a tabulation of present and future end-uses. 

After experimenting with various types of trend patterns, and with periods to 
base them on, the author settles for an exponential trend based on 1910 to 1954. It 
implies doubling of the tonnage of aluminum consumed every 7 to 8 years and results 
in a projection of 8.5 billion pounds of aluminum by 1965. Next, in the correlation 
approach, he compares first, total aluminum demand and later, imputed civilian 
demand with various business indicators and some of their major components. He 
also experiments with specific price measures and finally with “time.” Two annual 
series on price differentials between aluminum and competitive materials, based on 
comparable performance specifications (Appendix, Table 5, p. 67) are welcome addi- 
tions to the body of analytic price information. The end-use survey is a compilation 
of available information about the demand for aluminum by present and potential 
users; it contains a review of the outlook for consumption by the military, and by 
the building, transportation equipment, electrical equipment, appliances, and con- 
tainer industries. Tables 4 and 5 on page 61 summarize demand in 1954 and give pro- 
jections for 1965. The three approaches give results varying between 8.4 and 8.9 bil- 
lion pounds, compared with only 3 billion pounds in 1954. “All in all, the results of the 
three separate approaches coincide well enough to lend authority to forecasts of the 
demand for aluminum at a level two to three times that of 1954.” (p. 63) 

The procedures used provoke several comments. To begin with, it is questionable 
whether the three types of estimate are really independent. The correlation approach, 
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relating aluminum consumption to Gross National Expenditure, time, etc., is closely 
related to a simple extrapolation of the trend of aluminum demand. This is so not 
only because “time” is used in the correlation approach as an independent variable, 
but also because of the systematic relation between the secular trends in aluminum 
consumption and Gross National Expenditure. What about the end-use estimates? 
The discussion in Chap. IV is so broad that it supports the tabulated end-use estimate 
only in the most general way. Under these conditions mutually corroborative evi- 
dence is easily subject to common biases and should be evaluated with a good deal 
of caution. 

Questions arise also concerning some technical points. The author’s estimate of 
post-World War II civilian consumption of aluminum is an extrapolation based on 
the relation of total consumption to Gross National Expenditure during the pre-war 
years. At a later stage the relationship of the estimated civilian aluminum consump- 
tion to the several components of Gross National Expenditure is evaluated for the 
entire period and found to be close. The element of circularity in this procedure is 
obvious. 

Another point involves the use of aluminum prices in the estimating equation. 
Frequently this type of approach produces a positive sign for the price coefficient, 
because both prices and sales tend to fluctuate with the business cycle. The author 
obtains a negative coefficient of —.3, which he interprets as indicating “.. . a rela- 
tively inelastic price elasticity of demand...” (p. 26). Aluminum consumption 
showed a steeply rising, and prices (with little cyclical variation) a basically falling 
trend during this period. Thus the price coefficients in the estimating equation are 
mainly trend-determined and bear little or no relation to any short-term concept of 
“price elasticity of demand.” 

The failure to distinguish between short- and long-term relationships is also re- 
flected in the author’s selection of basic estimating variables. 

He finds that (deflated) Gross National Expenditure or its components provide a 
better estimating basis than the Federal Reserve Index of Production, and he bases 
this statement of the over-all regression between the variables. A segregation into 
cyclical and trend relationships would have led to analytically more meaningful for- 
mulations, and, perhaps also, to the use of variables that are economically more 
closely related. Rosenzweig discounts the importance of cyclical patterns for long- 
term forecasts. (p. 11) Yet his scatters, residuals, estimating equation, etc., show 
clearly the effects of the cyclical behavior of his variables. The need to distinguish 
clearly between short- and long-term relationships is particularly great in the articu- 
lation of forecasting models. The growth of aluminum demand from year to year is 
not uniform. During the period selected the growth of aluminum demand is dominant 
but within the confines of the upward sloping band, some cyclical ups and downs and 
a distinct break in the continuity of the trend, during World War II, are apparent. 
Whenever observations exhibit such characteristics, it is dangerous to assume that 
they are scattered at random around a smooth upward sloping line. In the case at 
hand, it may lead to an upward bias in the forecast, particularly if the projection 
covers an extended period. 

This brings us to a more basic point, that is, the formalistic character of the fore- 
casts based on such models as underlie the trend and correlation approach. Charac- 
teristic is the use of “time” as an independent variable, whether as catch-all for neg- 
lected factors or “designed specifically to portray the development of aluminum. ... ” 
(p. 24) The future of economic trends cannot be derived by “extrapolation.” It is 
essential to consider the prospects of the economic forces and of the institutional 
conditions which shape these trends. Without consideration of the technological and 
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market determinants of such trends, forecasts must, of necessity, be mechanical 
and subject to gross errors whenever major changes in applications, material sub- 
stitutions, ete., occur or are in the offing. In this context it might also be mentioned 
that the domestic scene does not encompass everything that bears on the future of 
aluminum demand. Embedding the analysis in the larger context of world supply and 
demand pressures would appear to be a necessary condition for realistic anticipation. 

Chap. IV, which contains concrete technical and market considerations, is a step 
in the right direction. Unfortunately, the necessary information for good quantified 
forecasts, in specific fields of application, is hard to come by. Maybe this situation 
indicates in what direction the next step forward should be made. Taking formal 
projections, like those here under review, as a point of departure, forecasters in the 
field of aluminum demand can now begin to perfect forecasting procedures and to 
prepare forecasts in their various field of special competence. This calls for an in- 
genious blending of knowledge in the fields of industrial engineering, economic anal- 
ysis, and statistical technique. If such endeavors could be carried through, published 
and finally integrated, then the forecasting of aluminum demand (and by analogy, 
that of other industrial commodities) would make significant progress. In the mean- 
time, Rosenzweig’s publication robs us of an excuse for further additions to the private 
files of formalistic forecasts. This alone is enough to earn him the thanks of all his co- 
practitioners in this delicate art. 


The Growth of Public Employment in Great Britain. Moses Abramovitz and Vera Elias- 
berg. Princeton: Princeton University Press, 1957. Pp. xiii, 151. $3.75. 


J. Wiseman, London School of Economics 


I view of the willingness of economists to put forward proposals about government 


economic policies, it is surprising to find how little attention they have paid to the 
actual size and economic character of the activities of governments, and to the way 
these things have changed over time. The book under review is one of a series of 
studies sponsored by the National Bureau of Economic Research with the object of 
remedying this defect. It is a parallel study to Solomon Fabricant’s earlier work in 
the series: The Trend of Government Activity in the United States since 1900. Unlike 
that study, the present book does not consider all the economic activities of the 
British Government, but is restricted to examination of government employment of 
labor. Government expenditure as a whole is to be treated in a subsequent study by 
A. T. Peacock and Jack Wiseman. As the authors themselves point out, labor statis- 
tics alone have serious limitations as indicators of economic change. But they are by 
no means without value, as this study shows. 

Although the book is concerned with the period 1891-1950, there is a preliminary 
chapter which describes the major developments in the growth of the British Govern- 
ment in the nineteenth century. Quantitative information about this period is scanty, 
and the authors give a general impression supported by illustrative statistics rather 
than a comprehensive survey. Subsequent chapters present and discuss the detailed 
statistics of government employment since 1891. First, a general view of the expan- 
sion of government (as measured by labor employed) is provided. The rapid growth 
in the volume of employment provided by government that here becomes apparent 
provides the background for a more detailed breakdown of the data, and also invites 
speculation about the causes of the growth. Next, successive chapters treat labor em- 
ployment by the central government, local authorities and nationalized industries 
separately. The statistical information on central and local government covers the 
whole of the relevant period comprehensively and in convenient classifications. As a 
result of the limitations of the available data and of the history of the industries 
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concerned, the chapter on nationalized industries is inevitably less comprehensive. 
Adequate statistics are provided for their employment of labor in 1950, but a satisfac- 
tory historical perspective would call for comparable data for the pre-nationalization 
era, when some of the industries were privately owned and others belonged elsewhere 
in the public sector. However, some notion of the historical evolution can be obtained 
from the statistics of pre-war government employment in trading services provided 
in other chapters, and from individual comparative statistics and descriptive ma- 
terials provided in the text. A final chapter presents a comparison of the relative 
growth of government employment in Great Britain and the United States, and an 
appendix describes the sources used and the methods of compiling the statistical 
tables in gratifying detail. 

This volume makes a useful contribution to our knowledge of the economic ac- 
tivities of the British government. It also encourages speculation about the nature of 
the causal influences on the growth of government activity by directing attention to 
the characteristic features of that growth. For example, both America and Britain 
have seen considerable economic growth since 1900. In both, government economic 
activity (at least as measured by labor employed) has been the most rapidly expand- 
ing sector of the economy. Is this similarity due to general causes, operating despite 
the admitted differences between the two economies? The authors believe that it is, 
and present stimulating hypotheses on this and other matters, that would seem to 
warrant further investigation. Although the authors do not dwell on this, their ma- 
terial also makes clear (particularly when considered alongside Fabricant’s study) 
the limited value of formal economic analysis in understanding or explaining the 
economic activities of governments. There is room both for a new intellectual ap- 
proach and for further work of this kind on the facts. 


World Commerce and Governments: Trends and Outlook. W. S. Woytinsky and E. S. 
Woytinsky. New York: The Twentieth Century Fund, 1955, Pp. li, 907. $10.00. 


ALEXANDER Eckstein, Harvard University 


NE can but stand in awe of this monumental work of encyclopedic range which 
serves as a sequel to an earlier volume on World Population and Production. In 
effect, these two volumes together represent in a way a combination of a United 
Nations Statistical Yearbook with a certain amount of historical depth and a popu- 
larized or diluted version of an Encyclopedia of the Social Sciences. However, in 
another sense this comprehensive survey may have a twofold origin derived from 
Woytinsky’s earlier work Die Welt in Zahlen, and the Twentieth Century Fund study 
on America’s Needs and Resources. In any case, it is a work of a breadth and scope 
that requires tremendous erudition, judgment, a good empirical feel, and stick-to- 
itiveness—all qualities with which the Woytinskys are unusually well-endowed. 
The central theme of both volumes as stated by the authors themselves is “to 
study world trends during the fateful era in which the mechanized economy orig- 
inated on the two coasts of the North Atlantic is becoming the universal civilization 
of mankind.” This is a comprehensively documented record of the expansion of the 
world economy under the impact of industrialization. The principal carrier of this 
expansion was Europe in the nineteenth century—particularly Britian—and the 
mantle was largely inherited by the United States in recent decades. In many ways, 
foreign trade and colonialism, sometimes separately and at other times in combina- 
tion, served as the highway over which new influences, new ideas, and new technology 
traveled. 
This general theme is developed in three parts: Part I deals with trade, Part II 
with transportation, and Part III with governments. The section on trade is focused 
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almost exclusively on foreign trade. It starts with an introductory chapter on Trade 
in the World Economy, which surveys very briefly the evolution of trade, both in- 
ternal and external, through economic history, and provides some data on the place 
of trade in national economies. This is followed by chapters on The Value and Volume 
of Foreign Trade, Direction of World Trade, The Composition of Foreign Trade, 
Balances of Payments, and Tariffs, Trade Agreements and Trade Restrictions. 

The picture which emerges from these chapters and the rich tabular material in 
them is a familiar one, which however gains in perspective by virtue of its compre- 
hensive coverage in time and space. It shows a rapid growth in the volume of foreign 
trade, which nevertheless lags behind the rate of growth in industrial production for 
the period 1870 to 1952 as a whole, although for some years preceding World War I 
and during the Great Depression, these trends were reversed. One of the facts that 
emerges particularly strikingly from this survey is the relative displacement of 
Europe by the United States in total world trade. This is particularly marked in 
respect to world exports, Europe’s share declining from about 51 per cent in 1913 to 
38 per cent in 1952 and the United States’ rising from 16 to 27 per cent. However, 
while total world trade and the role of the United States in it were expanding, the 
importance of the foreign trade sector in the domestic economy of the United States 
was shrinking, as is illustrated by the fact that foreign trade in relation to national 
income declined from about 21 per cent in 1880 to only 8 per cent in 1950. 

At the same time, Part I shows the emergence of a world trading network and 
analyzes its characteristics. This part of the survey leans heavily on Folke Hilgerdt’s 
League of Nations studies on Europe’s Trade and The Network of World Trade. The 
authors complete this part of their study by tracing how this world trading network 
was buffeted and transformed under the impact of two world wars, the Great Depres- 
sion, and the growing restrictions on trade, to the point of dissolution of a single world 
trading system. 

In general, Part II follows the pattern sketched in the preceding section of the 
book. It starts with a discussion of the importance of transport in the world economy 
and in different national economies, which is measured in terms of the transport share 
in total employment and in national income. It then proceeds to trace the evolution 
of different modes of transport, expansion in aggregate freight and passenger traffic, 
and shifts in the relative importance of the different modes of transport. It also 
touches lightly upon the economic effects of transport development and the economics 
of the different transport industries. 

Finally, Part III is divided into six chapters, starting with “Nations and Govern- 
ments,” which provides data on the number of independent states, their area and 
population, and their births and deaths. The next chapter on “Elections and Parlia- 
ments” is largely a review of different electoral systems, of voter participation, 
governmental and constitutional forms, etc. The remaining chapters deal with 
“Colonial Empires,” “Government Expenditures and Revenues,” “Public Debt,” 
and “International Cooperation.” 

It is, of course, difficult to summarize or even to indicate the general contents of 
such a vast work within the compass of a brief review. By its very nature this is a 
survey in width rather than an analysis in depth. It is also necessarily uneven. Thus 
foreign trade is much more adequately dealt with than colonial developments or 
public finance. Throughout, there are sections and special topics that are treated 
quite superficially. For instance, all of the outlook discussions at the end of each 
chapter are quite unrewarding, and are based less on systematic and sophisticated 
forecasting than on belabering the obvious. 

In appraising and commenting upon this survey, one ought to perhaps distin- 
guish between the statistical tables and the text. The statistical material should be 
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most valuable for reference purposes to social scientists and laymen alike. The textual 
narrative, however, is likely to prove of comparatively limited value to the specialist, 
though it may prove useful to the layman. Yet, by no means is this intended to sug- 
gest that the statistics used are above reproach. As a consequence of the authors’ 
desire not to overload the tables with long qualifying footnotes and extended method- 
ological discussions, the unwary layman for whom much of this is intended might be 
easily misled by a lack of explanation concerning the quality, character, comparability 
and reliability of statistics in different parts of the world. One would have thought 
that at least a general discussion of this problem should have been incorporated into 
the introduction. This assumes particular importance when the authors use data of 
clearly dubious validity without explicit qualification. As an instance, one might cite 
the rather meaningless population figure of 463.5 million for China, or the grossly 
erroneous national income figure for the USSR given as $59.5 billion for 1949. One 
might also question the meaningfulness of an estimate of world income. Another 
serious shortcoming of this book is its comparative neglect of Asia. The text deals 
almost exclusively with Europe and the U. 8. For example, in tracing colonial expan- 
sion, Japan’s colonial policy and empire are completely omitted. 

There are a number of other estimates, figures, and methods to which one could 
take exception. However, this is perhaps inevitable in a survey of this vast scope. It 
does not reduce one’s admiration for the authors’ breadth, ability, and willingness to 
tackle such a vast work, nor does it seriously impair the usefulness of this book as a 
reference work, particularly if used in combination with the original source references 
given in the valuable bibliography. 


Regional Income, Studies in Income and Wealth, Volume 21, National Bureau of Economic 
Research, Conference on Research in Income and Wealth. Princeton: Princeton Univer- 
sity Press, 1957. Pp. x, 408. $8.00. 


W. W. McPuerson, North Carolina State College 


HIs volume of Studies in Income and Wealth, edited by Frank A. Hanna, contains 

the ten papers, and comments of participants, which were prepared for the Con- 
ference on Research in Income and Wealth held in June, 1955. The subject matter 
includes discussions of problems and present status of concepts, procedures and data 
with respect to estimation and analysis of income geographically oriented on a local, 
state, or regional basis. 

Following the “Introduction” by Frank A. Hanna, the ten papers are: “Con- 
ceptual issues of regional income estimation” by Werner Hochwald, “Problems of 
assessing regional economic progress” by Harvey S. Perloff, “The value of the regional 
approach in economic analysis” by Walter Isard, “The geographic area in regional 
economic research” by Morris B. Ullman and Robert C. Klove, “Analysis of inter- 
state income differentials: theory and practice” by Frank A. Hanna, “Interregional 
differentials in per capita real income change” by Abner Hurwitz and Carlyle P. 
Stallings, “City size and income, 1949” by Edwin Mansfield, “Appraisal of alternative 
methods of estimating local area incomes” by Lorin A. Thompson, “Measurement of 
agricultural income of counties” by John L. Fulmer, and “Development of postcensal 
population estimates for local areas” by Henry 8S. Shryock, Jr. 

For professional workers who are engaged either in the development or use of 
income estimates, the discussions of concepts, procedures, and data used in geographi- 
cally (space) oriented income estimations provide a useful review. It is generally 
agreed that available data are incomplete, inaccurate, and/or improperly classified 
for fully meeting the demands for space oriented income estimates. However, too 
much attention is given to the search for “short-cut” techniques to use as substitutes 
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for adequate and accurate observations of the past, and not enough attention is 
given to the question of how, in the future, the adequacy and accuracy of observa- 
tions might be increased. Also, the book contains little discussion of Regional Income 
oriented to the future. As guides to public policy as well as to many economic deci- 
sions of private firms, the work must go beyond estimates and analysis of past income 
and resource use to explore future production and income possibilities on a geograph- 
ical basis. To do so requires more complete and accurate quantitative and qualitative 
inventories of endowed resources, technical coefficients of production, and prospective 
regional differences in factor supply and product demand. 

Limitations of space will permit only a few specific comments here. Walter Isard’s 
concern about the need for an overhauling of the tool kit for regional analysis (p. 78) 
is a valid one. But much more can be done than yet has been with the present state 
of economic and statistical knowledge, especially if the adequacy and accuracy of 
observations are improved. 

This book has failed to exploit fully economic knowledge in the construction of 
logical models, especially in the analytical work. For example, in the selection of 
independent variables, for meaningful analytical results, more attention should be 
given to those factors that are expected to generate income and differences in resource 
use rather than those items which merely show high correlation coefficients. Ernest 
W. Grove’s skepticism (p. 358) with regard to mechanical formulas of the type pro- 
posed in this book is warranted. These short-cut formulas for use in the estimation 
of agricultural incomes of counties, to the extent that they have a theoretical founda- 
tion, are based on an erroneous interpretation or on an erroneous application of com- 
petitive equilibrium theory. As J. N. Keynes states so succinctly, “If it [the method] 
is purely empirical, then it [the conclusion or the truth] will be established only with 
a more or less high degree of probability, and it cannot be extended far beyond the 
range of space or time which the instances on which it is based were collected. If, 
on the other hand, it is obtained deductively, then it is hypothetical until it has 
been determined how far, and under what conditions, the assumptions on which it 
rests are realized in fact.”! 

The need for data on a county as well as a state basis as cited by Harvey S. Perloff 
(p. 39) is very important, not only because of the changing patterns of economic rela- 
tionships with respect to time, but because effective analysis requires that data be 
classified and aggregated in different ways for different analytical purposes. Much of 
our present data are reported in such aggregative terms that the possibilities and 
effectiveness of analytical work are reduced considerably from what otherwise would 
be possible. 

This book should promote an increased interest and an acceleration in the joint 
efforts of statisticians and economists in the estimation of incomes that can be used 
effectively in any analysis of economic activities in various geographic areas. 


Views On Regulation, Consumer Instalment Credit, Part III, Vol. 1. Geerge D. Bailey. 
Washington, D. C.: U. 8. Printing Office, 1957. Pp. vii, 230. $1.00. Paper. 


JuLtes Backman, New York University 


HIs is one of six volumes included in the Federal Reserve Board’s survey of con- 
‘Ronis installment credit. It contains letters—or digests of letters—received mainly 
from various business groups concerning their attitudes toward the desirability of 
installment credit regulation. Trade associations, commercial banks, sales finance 
companies, manufacturers, and various retail groups were among those surveyed. 





i John Neville Keynes, The Scope and Method of Political E. Fourth editi New York: Kelley and 
Millman, 1955, p. 5. 
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A few comments were also solicited from labor, agriculture, and consumers whose 
comments rated 4 pages out of 230. 

The responses are grouped by the source with a brief introductory paragraph or 
two to show the nature of the group or the relative importance of consumer credit to 
it. The report could have been enriched if these introductory sections were made more 
comprehensive. Approximately 2,000 opinions were solicited and about 800 replies 
were received. It will be no surprise to learn that the respondents—who generally 
would have been affected by the proposed control—were overwhelmingly and vigor- 
ously opposed to new regulations. 

A brief introductory chapter reproduces the letters sent to respondents. Attached 
to each letter was a summary of the background of the study and an outline of the 
arguments for and against regulation. It is not a coincidence that the respondents 
generally emphasized the same arguments outlined in this background report. More- 
over, the summary of views presented in the introduction to the report is substantially 
a reproduction of the outline of arguments contained in the background memorandum. 

The authors of the questionnaire were able to determine in advance the general 
lines of argument—a not very difficult task in light of the voluminous material pre- 
viously available. Under these conditions, serious question may be raised concerning 
the need for this volume. If it was desired to determine the relative weight to be given 
to each argument—and no such evaluation is made in the volume—it would have 
been more logical to present a tabular arrangement of the arguments and to request 
respondents to rank them in terms of their importance. 

While a few of the letters contain some interesting materials (e.g. data showing 
repossessions, delinquencies, ete. (pp. 122-25, 162)), there is a deadly repetition of 
the same views over and over and over and over again. The author says he was asked 
“to canvass the views of the consumer credit industry and of others interested in 
consumer credit with respect to the desirability of installment credit regulation.” 
(p. iii) He performed this task. However, it would have been more useful to obtain 
information concerning their experience with installment credit (e.g. relative im- 
portance in terms of volume of sales, credit experience, changes in terms, etc.) than 
their already known views concerning the lack of desirability of regulation. 


Financing New Car Purchases, Part IV of Consumer Instalment Credit. Board of 
Governors of the Federal Reserve System. Washington, D. C.: U. 8. Government Printing 
Office, 1957. Pp. xii, 151. $0.60. Paper. 


Davin I. Fann, Committee For Economic Development 


ARLY in 1956 the Chairman of the Council of Economic Advisers, on direction of 

the President, asked the Board of Governors of the Federal Reserve System to 
undertake a comprehensive study of consumer installment credit and its bearing on 
the attainment of the broad goals of national economic policy. This report is part of 
the ensuing study: 

In the fall of 1954 recovery from the moderate recession that began the year before 
was underway. A sharp rise in new car sales was important in this recovery and in the 
continued business expansion of 1955. Among the factors that have been credited 
with the expansion in automobile sales are easy credit, liberalization of installment 
terms, the unusual style appeal of the 1955 model, and the increased competition of 
producers and dealers. Installment credit increased by a record $5.5 billion, or nearly 
one-fourth, in 1955. 

Financing New Car Purchases is based on a national survey conducted by National 
Analysts, Incorporated, of Philadelphia under the direction of the Board’s research 
staff. The national survey, conducted in June and July of 1956, included personal 
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interviews with over 4,600 buyers of new cars in 1954 and 1955, including about 3,000 
credit buyers. It also included reports from the lending institutions involved in over 
5,700 credit purchases of new cars, including 2,750 transactions for which the buyers 
also were interviewed. 

The volume includes a summary chapter, six chapters presenting important find- 
ings, 71 tables that provide a wealth of data on almost every conceivable aspect of 
automobile financing, and an appendix furnishing a fairly detailed description of the 
survey—the sampling plan, the interviews, the lender reports and the types of errors 
involved. 

Summarized below are some of the more interesting findings and conclusions: 

1. From the first half of 1954 to the second half of 1955, maturities of 30 months or 
more rose from one-third to two-thirds of all installment purchases; effective down- 
payments of less than 25% of the effective price rose from less than one-third to al- 
most one-half of all installment contracts. Terms were liberalized in all regions of the 
country, by all types of lenders, and to buyers in all income and net worth groups. 
From 1954 to 1955 credit purchases rose almost one-half, while cash purchases rose 
less than one-sixth. Credit buyers in 1955 outnumbered cash buyers 2-to-1, and in 
middle and lower income groups by 3-to-1. The report concludes that liberalization 
of installment credit terms stimulated new car purchases. 

2. In the 1954-1955 experience, relatively small down-payments were commonly 
associated with long maturities. Buyers with lower incomes or in weaker net worth 
positions frequently made smaller down-payments and obtained longer maturities. 
This is at variance with the common supposition that longer maturities usually go to 
buyers with large equities or to more creditworthy buyers. 

3. The proportion of credit buyers who bought new cars before they completed 
payments on their trade-ins rose from one-seventh in 1954 to one-fifth in 1955. Both 
credit and cash buyers who purchased their trade-ins as new cars came to the market 
for replacements much sooner in 1955 than in 1954; the median interval declined 
from 42 to 38 months for cash buyers and from 37 to 32 months for credit buyers. 
Car owners shifted from used to new cars in both 1954 and 1955; one-third of the new 
ear buyers who traded in or sold a car to buy a new one had purchased the trade-in 
as a used car. 

4. The median monthly payment on new car installment contracts remained con- 
stant at about $72 during these two years; the monthly payment in relation to income 
actually declined a little. Lengthening of maturities enabled consumers to purchase 
more expensive cars and to incur bigger debts without devoting more current income 
to installment debt repayment. 

5. The median ratio of monthly payment to disposable income for all new car 
credit buyers in 1955 was 15%. Among the 30% of credit buyers who had disposable 
incomes under $5,000, however, 60% allocated more than 20% of their disposable in- 
come to new car debt repayment; with 40% paying more than 25% of their current 
income. 

6. The upswing in new car purchases was general. All income groups, all regions, 
and most city-size groups increased their purchases, the prices they paid, and the 
frequency with which they bought on credit. Trading-up to more expensive models 
was more marked in lower and upper income groups than in the middle income 
groups. 

7. Higher income buyers bought for cash more frequently than lower income 
buyers, and when they used credit, made larger down-payments and negotiated 
shorter maturities. Almost all credit buyers used a single source of credit; the use of 
supplementary loans for the down-payment was infrequent. 
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8. The rise in both credit and cash purchases and the degree of trading-up to more 
expensive models were most marked in the west. A higher proportion of buyers in the 
west used credit, and maturities of 30 months or longer were more frequent in the 
west. Maturities of 36 months, however, were more common in the northeast. Down- 
payments were typically lowest in the west and south. 

9. There is very little evidence that, for the country as a whole, any one type of 
lender consistently led in liberalizing auto credit terms. In some localities, the first 
to liberalize was a major sales finance company; in others, a small regional or local 
sales finance company; in still others, a bank. 

10. Effective annual finance rates on most new car contracts were between 9% 
and 13%. The median rate of 11% in 1955 was only slightly higher than that of 1954. 
New automobile loans by banks directly to consumers bore rates below those on 
paper purchased by banks from dealers and other lenders. Installment contracts 
purchased by banks carried lower rates on the average than paper purchased by sales 
finance companies. 

11. As a result of liberalization of installment credit terms in 1954-1955 the 
equities of buyers on their new cars were reduced, the periods over which their 
equities would remain small were extended, and the periods during which adverse 
developments in employment or in automobile markets could affect repayments 
were prolonged. Financial institutions were thus exposed to greater risk. Actually 
no serious problems have developed so far. By the time of the interview in mid-1956, 
only 1-in-20 of the 1954-1955 new car installment buyers had refinanced his con- 
tract; only 1-in-50 had lost his car through repossession. 

12. Approximately one-half of the installment buyers in 1954-1955 said they would 
not have bought a new var at the time they did if substantially larger down-pay- 
ments or shorter maturities had been required. 

13. Finally, installment credit was used on a wider scale by middle and upper in- 
come buyers. By 1955 more than half of new car buyers at all income levels up to 
$15,000 used credit financing, and those with incomes of $5,000 or more accounted 
for four-fifths of the total increase in new car credit purchases. 

The very excellence of the report brings into sharp focus some of the basic limita- 
tions of the survey approach for problems in economic policy. Consider what is 
undoubtedly the basic conclusion of the report, that “the countrywide availability of 
installment credit on easier terms stimulated new car purchases.” This conclusion is 
based on (1) the fact that credit purchases increased almost one-half while cash 
purchases rose less than a sixth, and (2) that over half of the installment buyers in 
1954-1955 stated that they would not have bought the new car at the time they did 
if larger down-payments or shorter maturities had been required. Whether this 
information substantiates the conclusion may be questioned. 

First, is it true that increased installment credit on automobiles has its main or 
sole effect on automobile purchases? Does the form in which credit is available deter- 
mine the way in which the loan is spent—so that increased mortagage lending, for 
example, affects housing expenditures primarily and increased installment credit 
affects durable goods expenditures primarily? 

From the first quarter of 1954 to the fourth quarter of 1955, total consumption 
expenditures rose by $27.8 billions, while durable goods expenditures rose $6.9 
billions. Although the percentage rise was most pronounced in durable goods ex- 
penditures, durables account for only 25% of the dollar increase. How then do we 
know that the increase in installment credit over this period from $23.4 billions to 
$28.4 billions went mainly to finance the increase in durable expenditures? 

An ordinary household without marketable collateral cannot readily borrow from 
most financial institutions, except personal finance companies, even though this 
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household has fairly good income prospects and positive net worth. It is very easy, 
however, for this household to borrow against the security of a durable. Is it, there- 
fore, not reasonable to expect that a given household which intends to spend more 
than its income will borrow against the durables? And, if this is so, will it not always 
appear that liberal terms have had a greater impact on auto sales or housing starts 
than they actually had? 

Second, does the simultaneous increase in new car sales and installment credit 
outstanding prove that easier terms were mainly responsible for the increase in car 
sales? The proportion of new cars that is bought on credit varies considerably; indeed, 
over the last ten years the proportion of credit sales has varied from 42% to 68%. 
Moreover, the proportion of credit sales varies directly with total car sales so that the 
higher the sales, the greater tends to be the proportion of credit sales. In part, this 
positive correlation between the proportion sold on credit and total sales may reflect 
a positive correlation between easier terms and total sales; but it is also true that an 
increase in the demand for new cars—say, because income has risen or the new model 
is very popular—results in a higher proportion sold on credit ¢ ven at existing install- 
ment terms. The conclusion that easier terms stimulated new car purchases because 
credit sales increased considerably more than cash sales may overstate the influence 
of the easing of terms. 

Third, do the replies of respondents demonstrate that easier installment credit 
terms were either necessary or sufficient to account for the increase in new car sales? 

The report states that “approximately one-half of the installment buyers in 1954- 
1955 said they they would not have bought the new car they did at the time if sub- 
stantially larger down-payments or shorter maturities had been required”; this fact 
is used to buttress the conclusion that easier installment terms stimulated new car 
purchases. The evidence on the probable effects of tightened terms is based on the 
following two questions: “Suppose you had to make a substantially bigger cash 
down-payment” and “Suppose the monthly payments had been substantially more— 
would you have bought the car?” In reply to these questions, 49% said they would 
not have bought a car if bigger down-payments were required, and 59% would not 
have bought a car if larger monthly payments were required. 

The conclusion given in the report on the basis of this evidence is somewhat 
doubtful. First, accepting the findings at face value, it would appear that at least 60% 
of the credit buyers would not have bought a car since shorter maturities alone would 
have discouraged 59%. Second, even if we accept the 50% figure for the number of 
credit buyers who would not have bought without easier terms, the resultant total 
car sales are too low to be believable; for this implies car sales in 1955 (to other than 
businesses and governments, the universe covered by this report) of 4.520 million as 
opposed to 4.990 million in 1954. Third, it is not safe to place too much weight on 
these questions since they were ambiguous in that they did not specify alternative 
terms—were installment terms to be as restrictive as 1952-1953 or stricter. 

The evidence presented in the report would not convince a skeptic that easier in- 
stallment credit terms increased car sales, although this conclusion is not unreason- 
able. It is quite possible that with the same monetary policy—the same total growth 
of the assets of the banking system—but with loans not restricted to collateral arising 
from specific purchases such as autos or other durables, the same number of car 
sales would have been generated without easier installment terms. It is also possible 
that even without the expansion of the banking system the same number of cars 
would have been sold because the basic factors influencing the spurt of sales in 1955 
lay outside the financial sphere. According to this view, the growth of sales in 1955 
was due mainly to the new models introduced in 1955 and to the rise in income, and 
in the absence of these factors the easing in terms would have had little impact. 
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Spending of Middle-Income Families. Emily H. Huntington. Berkeley and Los Angeles: 
University of California Press, 1957. Pp. viii, 179. $3.00. Paper. 


Marearet G. Rerp, University of Chicago 


Y ire is another of the interesting publications of the Heller Committee for Research 
in Social Economics presenting data on income and consumption of nonfarm 
households in the area of San Francisco. An earlier study by E. H. Huntington and 
M. G. Luck, Living on a Moderate Income, 1937, related to families of street-car 
men and clerks. This one is concerned with salaried workers above routine clerical 
workers and below the executive class. Thus it relates to households excluded from 
the early large scale surveys of the U. 8S. Bureau of Labor Statistics which were con- 
fined to wage-earners and lower-salaried clerical workers. 

Various eligibility criteria as to household type and income narrowed the eco- 
nomic groups surveyed. These eligibility criteria greatly increased the difficulty of 
getting a random sample at low cost—a necessary condition for the survey. Various 
methods of sampling were used. These yielded data from 159 households for 1950. 

Income and expenditure patterns of these households are described in some detail, 
more space being given to medical and dental care and to clothing than to any of the 
other main categories. For both of these, data are shown for individuals as well as 
households. Since the set is small, only limited investigation of factors causing varia- 
tion in consumption was feasible. Some data by two main groupings are shown, 
namely by occupation and by relative income level, i.e., those with income under 
$6,000 and with incomes of $6,000 or more. Even for these rankings only limited data 
are shown. For example, for the occupational grouping household size is reported 
permitting some judgment of possible conditions other than income influencing 
food expenciture; but facts as to home ownership are not given, a very important 
condition affecting expenditures for housing. For the grouping by relative incomes 
expenditures are shown for only some of the main consumption items. Expenditure 
data by tenure are shown but income by tenure is not reported. 

An interesting comparison is made with a random sample of all households in the 
1950 survey of the U. 8. Bureau of Labor Statistics (p. 22) and also for installment 
buying between the data collected and those reported in the Consumer Finance 
Survey. (p. 164) 

The survey provides one type of information commonly omitted in current surveys 
of income and consumption, namely income in kind. If for no other reason this study 
is likely to be widely quoted. Income in kind for this group was important. Mean dis- 
posable income was $6,076 (pp. 13 and 14) and mean nonmoney income $751. (p. 37) 
Of this $528 was net occupancy value of owner-occupied dwellings. 


Migration Probabilities. Gunnar Kulldorff. (Lund Studies in Geography Ser. B. Human 
Geography No. 14, Departments of Geography and Statistics, The Royal University of 
Lund, Sweden) Lund: C. W. K. Gleerup, 1955. Pp. 45. 5:— Sw. Kr. Paper. 


Leo A. GoopMAN, University of Chicago 


FFICIAL statistics on migration often give information only about movements 

between administrative regions, while movements within regions are not re- 
corded. However, for some purposes, information concerning total migration from 
and within a region is needed. This monograph, Migration Probabilities, presents a 
careful investigation and clear exposition of the problem of estimating total migration 
for a given region when data about migration within the region are unavailable. 
The author attempts to deal with this problem by deducing, under some idealized 
conditions for certain kinds of situations, what he has called the “migration prob- 
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ability.” This is defined for a given region as the probability that a movement starting 
inside the region will cross the border and will end outside the region; i.e., the prob- 
ability that a movement will be recorded. If this probability is known, the recorded 
mobility data can be adjusted to obtain an estimate of the actual number of move- 
ments. In order to determine this probability, a mathematical model is contructed 
that takes into consideration the size and shape of the community, the distribution 
of the starting points and the distribution of the length and the direction of the move- 
ments. The migratioa probability is then derived as an explicit function of the param- 
eters characterizing this model, and tables of numerical values of the migration 
probability are provided for certain values of the parameters. To illustrate the use of 
these mathematically derived probabilities, some data concerning actual movements 
from and within a particular community (Asby, Sweden) are studied. A discussion of 
the consistency of the assumptions of the mathematical model with the observed data 
from this community is presented. Also, the numerical estimates of the migration 
probabilities, which are determined by the use of the community data to estimate 
the parameters related to the model, are compared with the actual migration ratios 
observed. The author mentions that further numerical studies are necessary before 
any general conclusions can be drawn. 

It might be worthwhile to study, in some detail, situations in which the mobility 
data available for a community are less complete than for the community that the 
author has studied intensively, and in which the estimation of the relevant param- 
eters is consequently more difficult (e.g., where the actual distribution of length of 
movements is not available for all movements, but only for certain kinds of move- 
ments). It might also be interesting to use the point of view and some of the particular 
results that the author has developed in order to carry out a comparative study of 
different communities, in the situation where the mobility data available for the 
communities were sufficiently complete so that the relevant parameters in the au- 
thor’s model could be estimated for each community. In some comparative studies, 
certain simplifying assumptions might be made, rendering unnecessary such complete 
data for carrying out the study. 

The results presented in this monograph are based on a mathematical model es- 
pecially appropriate for rural communities that are located in a larger area of similar 
character without dominating towns, and that have a uniform distribution of farms. 
It would be useful to generalize this model so that it would be appropriate also for 
other kinds of communities, and to see how results based on the generalized model 
might differ from the special results obtained by the author. 

This monograph is a good example of the application of clear thinking involving 
the development of mathematical models, the derivation of results based on these 
models, and the use of statistical methods in comparing various aspects of the models 
with relevant empirical data, in order to study a particular problem arising in the 
social sciences. 


The Economic Status of the Aged. Peter O. Steiner and Robert Dorfman. Berkeley, Cali- 
fornia: University of California Press, 1957. Pp. xx, 296. $5.00. Ageing in Industry. F. Le 
Gros Clark and Agnes C. Dunne. New York: The Philosophical Library, Inc., 1956. Pp. 
x, 146. $7.50. 


Junius A. Jann, Community Service Society of New York 


HESE two volumes will be of special interest to persons concerned with the applica- 

tion of statistical methods to study employment and associated health and welfare 
problems of aged or aging persons. Each provides considerable statistical information 
that is not otherwise available. In addition, relatively sophisticated statistical methods 
are used to define basic problems and to derive or test relevant hypotheses. 
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The Economic Status of the Aged is a report of the results of an area-sample survey 
of persons aged 65 years or over, in the population of the United States as of April 
1952, excluding persons residing in military reservations, penal or mental institu- 
tions, and homes for the aged, infirm, or needy. The survey was one of a series of 
studies of problems of aging by the Institute of Industrial Relations of the University 
of California. The sampling and interviewing phases were carried out by staff of the 
United States Bureau of Census, and included about 3,000 households with 3,600 
persons aged 65 or over. In addition to the population and labor force characteristics 
regularly collected by the Census Bureau, detailed information was obtained on 
sources and amounts of income, expenditures, and assets. 

In analyzing information, three major strata were used: married couples, unre- 
lated males, and unrelated females. Estimates of the numbers and percentages or 
averages are presented for the couples or individuals in each of these strata, classified 
according to each of the characteristics studied, and for cross-tabulations of selected 
characteristics related to employment status or to income. The presentation of these 
results is carried out with exceptional clarity of definitions and procedures. 

In the interpretations, however, two types of statements are used frequently in 
this report which may have some intuitive appeal, but which have little statistical 
merit. One is the use of results of statistical analyses of the relationships shown by 
cross-tabulations to answer questions which involve the notion of “causation.” For 
example, such as: “Why are they not in the labor force?”, and “What determines the 
size of an older person’s income?” (p. 5) Although it is possible to translate such 
questions into forms that can be answered by statistically derived or testable hypoth- 
eses, this has not been done in this study. The derivation or testing of such hypoth- 
eses by rigorous statistical methods would require the use of experimental designs 
rather than a sample survey such as was used in this study. The other type of ubjec- 
tionable statement involves the interpretation of results of statistical tests of hypoth- 
eses in terms such as: “The chi-squared test ... leads to a calculated chi-squared 
of 15.8 which, with 11 degrees of freedom, is readily attributed to chance fluctua- 
tions” (p. 46). If such statistical tests are to be applied at all, the hypotheses to be 
tested and the rules for rejection or acceptance should be more completely and clearly 
defined prior to examination of the sample data than they apparently were in this study. 

These criticisms do not detract from the main results of this report in providing 
more adequate statistical information on problems of aged persons, and some in- 
teresting hypotheses which can be made the basis for further statistical and experi- 
mental research on this problem. 

Ageing in Industry presents the results of statistical analyses of Census Reports on 
occupations, age, and mortality in England, Wales, and Scotland for the years 1921, 
1931, and 1951. Three types of statistical analysis were employed: The first involved 
computations to estimate the numbers of persons in each of 32 occupational classes 
and four age groups (35-44, 45-54, 55-64, and 65 or over), as of 1921 or 1931, who 
were in the same occupational class, retired, died, or who transferred to some other 
occupations by the time of the following Census year. The Census sources used did not 
provide for cross-tabulations between Census years, so that these estimates could 
not be computed directly as by the familiar “Cohort-Survival” method. Instead, some 
ingenious approximation procedures were applied, which are referred to by the 
authors by the term “Moving Cohorts.” These are well described and illustrated by 
the authors, but it is unfortunate that they did not sufficiently emphasize and eval- 
uate the use of these procedures as approximations to the “Cohort-Survival” method. 

The second type of anaiysis involved computations of ratios of the number of per- 
sons in each of the occupational classes who were 65 years or over to the number who 
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were 35-64, for each Census Year. These ratios were compared to corresponding 
“control” ratios of the number of persons 55-64 to the number 35-64. It is not made 
clear why these particular ratios were chosen rather than other apparently more com- 
parable ratios: for example, the comparison of the proportion of the total number of 
persons in an occupational class who were 65 or over, to the corresponding propor- 
tions who were 55-64; or, the ratios of the number of persons 65 or over to the number 
of persons 55-64 compared to the ratios of the number of persons 55-64 to the num- 
ber of persons 33-54 years of age. 

The third type of analysis involved computations of percentage increase or de- 
crease in the numbers of persons employed, between the years 1931 and 1951, for 
each of the 32 occupational classes and for each of the four age groups previously 
used, as well as for the remaining group under 35 years of age. 

The results of these statistical analyses are presented for each of the 32 occupa- 
tional classes, together with supplementary descriptive information on concurrent 
changes in conditions of production and employment in the occupations. These re- 
sults provide for a comprehensive but concise description of trends in the occupational 
status of persons in each of the age groups and occupational classes studied. The re- 
sults show trends that vary greatly between different occupations and age groups, 
and, also, that there are certain occupations for which the proportions of aged persons 
in the labor force has remained relatively high or has been increasing. 

In a concluding chapter, the authors attempt to use their results to answer a ques- 
tion that they have stated as follows: “What proportion of the men who reach their 
mid-sixties in a given job are physically capable of remaining on the same job into 
their late sixties, or even in some cases beyond?” (p. 134) Although the question as 
stated has apparent interest, it is unfortunately one which has not, and cannot be, 
statistically answered. The authors have provided answers in the form of numerical 
values for the upper and lower limits of “survival rates” for persons employed in each 
of the 32 occupational classes. Such rates even if clearly defined and correctly es- 
timated are not acceptable as measures of the “physical capability” of employed 
persons. Even after much careful study, this reader has been unable to find a defini- 
tion of the conditions and methods by which these “survival rates” were defined or 
estimated. Apparently they were not clearly defined, and were arrived at by intuitive 
judgments of the authors rather than by verifiable statistical procedures. Further- 
more, since the data used would at best only provide for approximations, with un- 
known amounts of error, of such “survival rates”, the results of the analysis might 
have been used to determine a rank ordering of the occupational classes according to 
their estimated rates. 

Instead of attempting to give such questionable answers to a statistically un- 
answerable question, the authors preferably could have limited their conclusions to 
generalizations and projections of the observed differences and trends in the em- 
ployment status of aging persons. 


American Families: A Demographic Analysis of Census Data on American Families at 
Mid-Century. Paul C. Glick, for the Social Science Research Council. New York: John 
Wiley & Sons, Inc., 1957. Pp. xiv, 232. $6.00. 


Joe. Smitu, Michigan State University 


ERIOUS students of the American family are unlikely to find a more comprehensive 
S or useful collection of demographic information on the subject than that con- 
tained in Glick’s monograph. The plans of the Social Science Research Council and 
the Bureau of the Census to make the treasures of census information more widely 
available by supporting the writing of a set of volumes organized around a series of 
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meaningful topics are justified by this effort. The brevity of the book belies the careful 
attention lavished on it by a man who is both involved with his special subject and 
experienced in the use of Census data. It also hides a breadth of coverage which in- 
cludes discussions of family living arrangements, household and family composition, 
the life cycle of the family, changes in family composition and in social and economic 
position during the family life cycle, first marriages and remarriages, marriage rates 
and the number of previous marriages, family dissolution through death, divorce 
and separation, and future household formation. 

Despite the wealth of materials treated, there are numerous questions which one 
might legitimately expect to be answered by an examination of the Census, but which 
must be given only brief mention because the necessary data are unavailable. Among 
such questions are those of trends in the average size of families (these must be es- 
timated from average size of household, p. 21), the impact of the last war on family 
size, accurate estimates of the years at which various important events in the family 
life cycle oceur, differences between the daytime and nighttime composition of the 
family, and differences in the average age of marriage of whites and nonwhites. 
Another manifestation of weakness in these data is the relative lack of depth in 
trend analyses. When addressing questions of trend, Glick is usually limited to only 
a few decennial Censuses for his observations and must supplement these with data 
from the Current Population Survey. The problem stems from instability and result- 
ing noncomparability in the Census Bureau’s use of family concepts in its work. 

While much of the weakness of Census data for answering questions about demo- 
graphic characteristics of the family is probably attributable to the lack of agreement 
among sociologists and anthropologists on a definition for “family” that can provide 
a basis for statistical record keeping, it must be concluded that the Bureau of the 
Census shares some responsibility for the situation. This should be evident when it is 
realized that information about members of nuclear families who do not reside in 
the same household is not collected in one place at the time of census taking. Prob- 
ably as a consequence of this circumstance, American Families begins with an un- 
avoidable extremely dull discussion of types of families, types of households, and 
relationships between these, and “family” phenomena must often be tortuously dis- 
cussed later because of the need to rely on household data. 

The caution Glick must exercise because of the restrictions of the data for his 
purposes results in a volume that cannot be recommended as general reading. It is 
written for specialists, and even they, confronted with a mass of data so concisely 
treated, will generally prefer to use the book as a reference work. To this reviewer, 
it seems that Glick might pérhaps have been dissatisfied with the prospect of having 
produced a reference monograph—even one that seems destined to attain a position 
that might best be summarized by the attitude that “if Glick doesn’t have it, there’s 
no Census information on the question”—and responded to this possibility by at- 
tempting to lighten the general dullness of the text. While one such response is a 
summary which redounds to the credit of the volume by presenting all of the major 
findings of the study in a smooth narrative, uninterrupted by tables or charts, the 
other response does not. 

Periodically, Glick lapses into speculations about the causes and/or implications 
of the data being analyzed. Since most of Glick’s statements have an authenticity 
supplied by their derivation from fairly reliable and valid supporting data, such un- 
supported speculations are given deceptive and spurious cogency by their contexts. 
While all of these digressions have not been enumerated, two might be mentioned for 
illustration. 
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In considering a greater increase in average family size of rural than of urban non- 
whites, Glick suggests that it is “reasonable to believe” that this trend results from 
the possibility that the farm te city migration of the period was a family movement 
and that the larger rural nonwhite families tended to remain on the farm. No sup- 
porting evidence for this plausible explanation is offered either directly or by ref- 
erence. At another point, in considering households in which a set of in-laws resides 
with a married couple, Glick speculates about differences in family relationships 
that may depend on whether the in-laws are the wife’s or the husband’s. He concludes 
that “Since the women in an extended family of this type are likely to spend more of 
their time together than the men, the family relationships are no doubt more har- 
monious, as a rule, when a mother and her daughter live together than they are when 
an in-law relationship is involved.” (p. 45) In this case, even the existence of differ- 
ences in family relationships which Glick proposes to explain by a “blood is thicker 
than water” principle, is not demonstrated. In all fairness, it should be emphasized 
that these digressions are infrequent and are mentioned only to sensitize the unwary 
reader to the presence of such speculations in a text composed largely of judicious 
statements. 

For the reader who finds it necessary to go beyond this volume and return to the 
Census, there are two appendixes containing very useful discussions of the quality 
and meaning of family data to be found in the Census. A final appendix summarizes 
the technique used in projecting the numbers of future families and households. All 
scholars wanting demographic materials on the American family will probably find 
themselves indebted to Glick for a long time to come. If American Families does not 
have the information sought by a user, he will at least have to thank Glick for sen- 
sitizing him to the weaknesses in the collection and reporting of family data by the 
Bureau of the Census. 


Psychological Tests and Personnel Decisions. Lee J. Cronbach and Goldine C. Gleser. 
Urbana: University of Illinois, 1957. Pp. xii, 165. $3.50. 


Ray Hyman, Harvard University 


HIs concise book restates the problems of psychological testing in terms of modern 

decision theory. The authors reject classical test theory and its emphasis on 
accuracy of measurement. They explore, instead, the implications of a model that 
emphasizes maximization of expected utility. The results, sometimes surprising and 
sometimes congenial to commonsense, often disagree sharply with the recommenda- 
tions and the folklore of classical test theory. 

The personnel decision, treated by this book, is one where information about an 
individual is used to assign him (by means of a strategy) to the treatment that wil! 
yield an outcome with maximum (expected) payoff. Cronbach and Gleser develop the 
analysis of this process for the situation where the tester seeks to maximize utility 
over a large number of comparable decisions (an institutional as distinct from an 
individual decision). The mathematics—most of which is relegated to seven ap- 
pendixes—assumes a cardinal utility scale, a known joint distribution of test score 
and outcome, and a payoff function of the first degree. 

Given these restrictions, the authors explore the effects of such variables as test 
validity, selection ratio, cost of testing, and criterion variability upon gains in utility. 
Gain in utility, Cronback and Gleser contend, should always be compared with the 
best a priori strategy rather than with a chance baseline. 

The effects of various combinations of parameters are examined separately for 
different kinds of personnel decisions. The book suggests a valuable typology con- 
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sisting of 64 kinds of decisions based on such factors as to whether the decision is 
institutional or individual, quotas are present or absent, rejection is possible or not, 
univariate or multivariate information is used, and the decision is final or tentative. 

The book is restricted, for the most part, to those situations where an institution 
assigns an individual to one treatment with or without quotas. Within this frame- 
work, the implications are worked out separately for selection (where individuals are 
accepted or re‘ected), placement (where individuals within an institution are assigned 
to a treatment on the basis of univariate information), and classification (where in- 
dividuals are assigned to one of several treatments on the basis of multivariate in- 
formation). 

Of special interest (especially for potential practical import), are the findings con- 
cerning the differences between fixed treatments and adaptive treatments (where the 
treatment is tailored to fit the individuals), and between single-stage and multi-stage 
or sequential testing. : 

There are succinct and stimulating discussions of such issues as the evaluation of 
payoffs, the interpretation of validity coefficients, the optimal distribution of effort 
among tests, the problem of individual decisions (the problem of guidance), the in- 
adequacy of discriminant functions for personnel decision, and the practical conse- 
quences of test theory. 

Clearly, the authors hope that their contribution will cause a revolution in current 
testing practice. They believe that “a fundamental revision of test theory will affect 
testing practice in the same manner that a fundamental change in political theory 
affects daily life in the nation that adopts it.” Whether this revision by Cronbach and 
Gleser will produce this desired change remains to be seen, but this reviewer is in full 
sympathy with the authors’ attempt. 

Psychological Tests and Personnel Decisions is an important and badly needed book. 
It brings together important ideas and trends bearing upon testing as well as con- 
tributing original suggestions along with a coherent organization. Although it is 
mathematical and nonempirical, its concepts and starting points are derived with 
the practitioner father than the mathematician in mind. As such, it cannot be ignored 
by anyone who is interested in the application of testing to personnel decisions. 


Techniques of Attitude Scale Construction. Allen I. Edwards. New York: Appleton- 
Century-Crofts, Inc., 1957. Pp. xiii, 256. $4.00. 


GerorceE E. Ferris, General Foods Corporation 


bpm book provides research workers who wish to use attitude scales with almost 
all the fundamental knowledge they require. In addition to summarizing the exist- 
ing methods for quantizing attitudes, certain original uses are outlined. Where previ- 
ously known methods are described, the present version is quite often easier to follow 
than the original. The book is well planned also for didactic purposes with realistic ex- 
amples at the end of each chapter. 

The introduction should be read not only by psychologists and statisticians, but 
by those interested in market and motivation research. The algebra involved in 
Chaps. 2 and 3 (and also later) is clearly set out. The practice of including in each 
chapter an internal check on the model proposed is a very commendable one. Two 
minor criticisms occur to the reviewer: a little more space could be devoted to proba- 
bilistic considerations (e.g. the Bradley-Terry model for paired comparisons ought to 
be cited as a reference in Chap. 3); and where x?-tests are recommended the derivation 
could be given or referred to. Only one misprint was observed (p. 27, in the formula 
a o* is needed in place of ¢) in what was a difficult text to print. The reviewer wishes 
that some illustration of the use of the scale, once established, had been given. 
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Petrographic Modal Analysis—An elementary statistical appraisal. Feliz Chayes. New 
York: John Wiley and Sons, Inc., 1956. Pp. x, 113. $5.50. 


JEROME CorRNFIELD, National Institutes of Health 


ETROGRAPHY is the study of rocks. Modal analysis is concerned with the composi- 

tion of rocks. This slim volume, written by a petrographer for petrographers, is 
that all too rare a phenomenon in many fields of science, a study of methods of 
measurement and of the factors affecting their precision. It should be of interest to 
statisticians because (a) it illustrates the application of statistical ideas and methods 
to a field of science with which many statisticians are unfamiliar and (b) it will sug- 
gest to the attentive reader several apparently new statistical problems. 

The fundamental idea of the book is that an estimate of the proportion which any 
particular mineral component constitutes of the complete rock specimen is provided 
by the proportion of times a randomly placed point falls in that component. Since it 
is not possible in practice to place a point at random in a volume of rocks, the pro- 
cedures actually employed involve the selection of random sections, and the use of 
randomly thrown points to estimate the proportion that the component of interest 
constitutes of the area of the sections. 

This is essentially a Monte Carlo procedure, and it is interesting to observe that the 
same idea has developed independently in a number of different sciences (and all of 
them well before the term itself was invented). Thus, the physiologist, H. W. Chalkley, 
proposed the same method for the same purpose in biology—the estimation of volume 
percentages of morphologic components in fixed preparations. Similarly, the Nobel 
prize winner, P. Debye, has elucidated the structure of solid porous catalysts by treat- 
ing the intensity of scattered X-radiation as a random variable. The anatomist, S. D. 
Wicksell, inferred the frequency distribution of the diameters of three-dimensional 
corpuscles of different sizes from the observed distribution of diameters in a random 
two-dimensional section. In some of these applications the random element is deliber- 
ately introduced by the observer, as in the applications of the volume under review. 
In others, Debye’s for example, it is a consequence of the method of observation. 
Whatever the motivation, all these applications are spiritual descendants of Count 
Buffon’s demonstration that the long run frequency with which a randomly thrown 
needle intersects the lines on a grid is a function of w. If a review existed of the major 
applications of this geometric probabilistic point of view, together with the major 
theorems it has discovered, some of which are quite startling, it would make fascinat- 
ing reading. 

The author addresses himself in his first two chapters to the task of convincing his 
fellow petrographers that his Monte Carlo procedure is a valid one. He then turns to 
a consideration of the reproducibility of results in given sections. The notion is here 
introduced that, if proportion p of a section consists of a given component, then an 
estimate of this proportion based on n independent throws has variance p(1—p)/n. 
Experimental results are cited in confirmation. In practice a cluster of m points is 
thrown so that the magnitude and direction of the error in considering n/m independ- 
ent throws of this cluster as equivalent to n independent throws depends on both the 
configuration of the cluster and of the mineral components. The author cites results 
which suggest to him that the binomial variance may tend to overestimate slightly, 
but does not consider the possibility that it may also underestimate. 

An interesting problem in geometric probability arises in considering variation 
among sections. The author proceeds entirely empirically. He assumes first that the 
section component of variance is inversely proportional to section area and then de- 
termines the constant of proportionality for three different rock specimens by experi- 
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mentally varying the area studied. (His results imply a different assumption from the 
one stated, however—inverse proportionality to a power of area. For the three rock 
specimens considered the power is less than unity. This suggests that adjacent areas 
in a section are more like each other than distant ones.) The constant of proportion- 
ality is assumed to depend on the coarseness of the rock specimen. A method of esti- 
mating coarseness from observations on a section is given and empirical relationships 
between proportionality constant and estimated coarseness are derived. On the basis 
of these relationships figures showing the number of sections of stated area required 
to achieve pre-specified precision are given. No deprecation of the usefulness of these 
results in planning future observations is involved in suggesting that subsequent 
treatment of this problem might benefit from mathematical formulation and analysis. 


Message Diffusion under Uncontrolled Conditions. Judson B. Pearson, Jiri Nehnevasja, 
and Rodney D. Elliott. University of Colorado Studies, Series in Sociology No. 3. Boulder: 
University of Colorado Press, 1957. $2.00. Paper. 


ANATOL Rapoport, University of Michigan 


HE purpose of the study is stated as the determination of “impact” of leaflet 

messages upon the community of Boulder, Colorado. The indices of “impact” 
were taken as 1) the proportion of the population which actually had a leaflet within 
48 hours of the “drop” by an airplane; 2) the proportion which discussed the leaflet; 
3) the proportion which had knowledge of the leaflet drop; 4) the proportion which 
complied with the request on the leaflet. 

The leaflets, of which there were several kinds, contained information about the 
activities and functions of Radio Free Europe and urged the recipients to contribute 
financially toward its operations. The “drop” was preceded by several days of pub- 
licity in the local paper, in the Denver paper, and over the mass media. Forty-eight 
hours after the drop face-to-face interviewers asked the occupants (who were found 
at home) of randomly selected 428 households (about 7% of the total households in 
the community) 26 questions. Of these, 13 pertained to the leaflet and so were not 
asked of those who had not had the leaflet in their possession. Eight of the remaining 
questions pertained to the identification of the respondent as to age, sex, marital 
status, etc. 

The results were as follows. Of the 428 respondents, 24 reported that they had or 
had had the leaflet in their possession. Of these seven could not recall anything of its 
contents, and 12 could recall only that it had something to do with Radio Free 
Europe. Five persons reported the essentials of the contents. Five persons in the total 
sample of respondents reported having made contributions to the Crusade for Free- 
dom (the sponsoring organization) since the leaflet drop. On the basis of this evidence 
the authors state, “Although the respondent sample is very small, one could logically 
conclude that the possessors of the leaflet were not noteworthily impressed by its 
contents.” With regard to compliance “impact” they state, “It is not likely that in 
Boulder the contributions resulting from the Sky-Drop even began to meet the ex- 
penses of the operation.” Knowledge about the drop fared better. After six days of 
spot announcements on radio, TV, stories in the papers, etc., 57% of the respondents 
answered “Yes” to the question of whether they knew about the Sky-Drop. 

The statistical analysis of the data pertains to attempts to uncover relationships 
between possession or nonpossession of the leaflet on the one hand and various other 
circumstances. For example, relating to the sex of the respondent one obtains the 
following table 


Male Female Total 
Leaflet Nonpossessors 82 322 404 
Possessors 6 18 24 
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Chi-square = .309 for 1 degree of freedom. Significant at the .60 level of statistical 
significance. 

With respect to marital status and length of residence in Boulder the level of sig- 
nificance is somewhat better, namely .20, while with respect to the number of chil- 
dren, it is somewhat worse, namely .80. Significance at the level between .01 and .02 
is obtained in relation to past contributions to the Crusade for Freedom. The authors 
are prompt to admit, however, that they do not know “if possession of a leaflet in- 
creased the tendency to report past contributions or whether past contributions in- 
creased the tendency to obtain a leaflet.” One might mention in passing that a report 
of possession of a leaflet might also have increased the tendency to report past con- 
tributions. 

The authors’ attempts to link the findings to a theory of information diffusion 
through a population are not clear. An equation proposed by Dodd appears in the 
introduction, namely, 


Ipp = kKPsAaPpAsCcTL, (1) 


where I,, is called the “expected number of interacts,” & a constant, P, the “tellers 
of the message,” A, their “tellings,” Pz the hearers of the message, Ax their “hear- 
ings,” Ce the “conditions” (“number of leaflets, value systems, etc.’’), JT’ the “dura- 
tion,” and Z~ the inverse distance between the interactors. What is not clear is how 
the formula purporting to describe the “principle of demographic gravitation” (as 
the proposer has called it) applies to the case at hand and how it is to be used if it 
does apply. For example, is equation (1) a theoretical prediction or a definition of 
“interactance”? If the former, how is “interactance” defined or observed? If the latter 
how are the constituent terms of the definition defined, observed, or counted. How 
does one count “values” contained in C¢? Or what is the “distance among the members 
of a population,” presumably designated by L? Distance between two individuals 
can, in principle, be defined, but what is distance among the members of a popula- 
tion? An average, perhaps? Again, what is a “telling” or a “hearing”? An instance of 
a meeting in which the message is mentioned? Or each mention? If several people 
are present, while one mentions the message, how many “hearings” and “tellings” 
have there been? Perhaps these matters are explained in the appropriate literature, 
but there is no indication of the relation between the supposedly existing meanings 
of these terms and the authors’ findings. 

A somewhat clearer attempt to test a “theory” pertains to the comparison between 
“physical” and “social” diffusion. If a person picks up a leaflet, this is physical dif- 
fusion; if he gets it from someone else, this is social diffusion. Of the 24 persons who 
had the leaflet, 12 (or “precisely 50%,” as the authors remark) said they got theirs 
through physical diffusion, and 12 through social. These findings are compared with 
the corresponding percentages in eight towns in the State of Washington, where 
somewhat similar experiments had been conducted previously. The per cent of 
“physical diffusion” in these eight towns was found to range between 39.6% and 
60.4%, the critical ratios being beyond the .05 level of significance in seven cases out 
of eight. It had been hypothesized that physical diffusion should significantly exceed 
social diffusion. As mentioned, the differences were “significant” in seven cases out 
eight. Unfortunately, five of these seven were in the direction hypothesized and 2 
in the opposite direction. The split in Boulder was precisely 50-50. The question of 
the relation of physical to social diffusion remains open. 

It might be hypothesized that in the cases of “social diffusion” the leaflet was given 
to the respondent (a housewife in possibly four cases out of five) by her husband or 
child. This hypothesis is fairly well confirmed by the fact that 10 of the 12 possessors 
(over 80%) reported that the leaflet was given to them by a member of their im- 
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mediate family. So far so good. It is difficult, however, to see the pertinence of the 
term “social diffusion” (except in the formal sense) to these acts. Looking at somewhat 
more relevant events, of the 24 possessors 6 reported that they started conversations 
about the leaflet with a total of 10 persons (an average of 1.6 hearers per teller.) 
Three of the six reported that the conversations were started with friends (average of 
1.5 per respondent, as the authors note) and one reported starting a conversation with 
a neighbor. One is somewhat at a loss what to do with these statistics, but we will 
venture a bold extrapolation below. 

The efforts involved in the study were of considerable magnitude. To estimate the 
concentration of leaflets in various sections of Boulder, one observer was placed on a 
mountain top, one in a tower, two atop buildings, and one rode along with the pilot 
of the Sky-Drop plane. The cross checks among the reports of these observers on 
where the leaflets went plus the reports of mobile observers cruising the town in 
automobiles enabled the investigators to designate areas of heavy, medium, light, and 
negligible leaflet precipitation. A table showing the distribution of the 24 possessors 
of leaflets among the 18 interviewing areas shows a correlation visible to the naked 
eye, from which the authors conclude that “Areas with a high concentration of 
leaflets were more likely to contain individuals who actually possessed a leaflet.” One 
might mention in passing that in this correlation one also obtains a check on the 
veracity of the reports, since the respondents could not be expected to know whether 
they were in a heavy, light, or medium area of concentration. 

Due care was also given to the randomization of the sample, involving the use of 
tables of random numbers, etc. The representativeness of the sample was checked by 
comparing the distribution of professions in it against that in the community as a 
whole. The statistical procedures in the analysis of the data were beyond reproach, 
and the conclusions were well within the confidence limits warranted by the analysis. 

The investigators were not the only ones involved in the effort. The entire com- 
munity was mobilized, indeed the entire nation—not actually in the investigation 
described but in the operation that made it possible—The Freedom Sky-Drop. Spon- 
sored by Crusade for Freedom, the American Legion, and the Civil Air Patrol, the 
operation, according to the authors “entailed the ejection of hundreds of thousands 
of pieces of printed matter from approximately 1000 Civil Air Patrol airplanes over 
some 200 selected cities throughout the country.” Thus the meager absolute return 
in Boulder should not be taken too much to heart. After all if 6 persons “started 
conversations” with an average of 1.6 persons in a sample of 428, we can expect 
that in the 200 cities bombarded (taking the sample as representative of Boulder and 
Boulder as representative of the 200 cities) we can expect fully 6000 persons to have 
said to as many as 10,000 (barring overlaps) at least “Well, didja see the Sky-Drop?” 
The ratio 1.6 is certainly a respectable propagation ratio and is theoretically sufficient 
to unleash a veritable avalanche of public enthusiasm for the Crusade for Freedom. 
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facilities. 

5. ORO occupies several buildings in Chevy Chase, 
Maryland, one of America’s most attractive 
suburbs. Pleasant homes and apartments in all 
price ranges are available. Schools are excellent. 
Downtown Washington, D. C. with its many 
cultura! and recreational advantages is but a 
20-minute drive. 

6. Favorably competitive salaries and benefits, ex- 

‘ tensive educational programs, unexcelled leave 
policy. 


FOR DETAILED INFORMATION, WRITE: 
Dr. L. F. Hanson 


OPERATIONS RESEARCH OFFICE 


(GRO) The Johns Hopkins University 


6935 ARLINGTON ROAD 
BETHESDA 14, MARYLAND 
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BIOMETRICS 


Journal of the Biometric Society 


Vol. 14, No. 1 CONTENTS March 1958 


How to Use Ridit Analysis (Irwin D. J. Bross) ; The Detection of 
Individual Differences in Accident Susceptibility (Robert Fitzpat- 
rick) ; The k-Visit Method of Consumer Testing (George Ferris) ; 
Precision of Estimates of Variance Components (Therese Kelleher, 
H. F. Robinson, and R. E. Comstock) ; Sampling Technique for Esti- 
_ mating the Catch of Sea Fish in India (P. V. Sukhatme, V. G. Panse, 
and K. V. R. Sastry) ; Statistical Methods for Comparison of Growth 
Curves (C. Radhakrishna Rao); A Multivariate Analysis of Co- 
variance (H. Fairfield Smith) ; The Method of Maximum Likelihood 
Applied to the Poisson Binomial Distribution (D. A. Sprott). 


Biometrics is published quarterly. Its objects are to describe and 
exemplify the use of mathematical and statistical methods in biologi- 
cal and related sciences, in a form assimilable by experimenters. 
Members of the American Statistical Association may subscribe 
through the Association at the rate of $4 yearly. The annual non- 
member subscription rate is $7. Inquiries, orders for back issues and 
non-member subscriptions should be addressed to: 


BIOMETRICS 

Department of Statistics 
Virginia Polytechnic Institute 
Blacksburg, Virginia 
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An immediate success! 


Basic Statistics 


A Textbook for the First Course 


Revised and Enlarged Edition 
By GEORGE SIMPSON and FRITZ KAFKA 


Published just last spring, the revised and enlarged edition 
of Simpson and Kafka’s Basic Statistics, together with its ac- 
companying problems and exercises book, is being accorded an 
enthusiastic reception in colleges and universities throughout 
the United States and Canada. Representative current adoptions 
include: Indiana University, University of Nebraska, Lafayette 
College, University of Southern California, Connecticut Col- 
lege, Tulane University, New York University, Bentley School 
of Accounting & Finance, De Paul University, Trinity College, 
University of Toronto, University of Bridgeport, University of 
Tulsa, University of Colorado, Bradley University, University 
of Arkansas, St. Peter’s College, University of Rhode Island. 


Features of the revised edition include: (1) a greatly ex- 
panded treatment of sampling; (2) new material on the normal 
curve; (3) the rewriting of many chapters, including those on 
general orientation, collection and sources of data, and current 
indexes; (4) several new appendixes; (5) new graphs and 
tables. 


Problems and Exercises for Basic Statistics, prepared by the 
authors especially for use with this new edition, contains more 
than enough work to develop or test the student’s grasp of all 
aspects of the material presented in the text. 


Text: 521 pages. Illustrated. Charts and tables. Appendixes. 

Price, $5.90 

Problems book: 92 pages plus 62 sheets of graph paper. Cumu- 

lative reviews. Price, $2.95. (Answer book available.) 
Examination copies supplied on request 


W. W. NORTON & COMPANY, Inc. 
55 Fifth Avenue New York 3, N.Y. 
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FUNDAMENTAL STATISTICS 


For Business and Economics 


by JOHN NETER, University of Minnesota 
and WILLIAM WASSERMAN, Syracuse University 


Here is one of the most widely accepted statistics texts specifically 
designed for students of business and economics. 


This text is non-mathematical and stresses the concepts under- 
lying statistical methods and their application as a means of mak- 
ing statistics a tool in decision making. 

FUNDAMENTAL STATISTICS FOR BUSINESS AND 
ECONOMICS includes many teachers’ aids; problems and ques- 
tions, footnotes, bibliographies, chapter summaries, appendic »s 
with tables, and calculational methods. 


for your examination copy, write to 


ALLYN and BACON COLLEGE DIVISION 
41 Mt. Vernon St. Boston 8, Mass. 














A DICTIONARY 
OF 
STATISTICAL TERMS 


By M. G. KENDALL and W. R. BUCKLAND 


_ new and extensive dictionary, prepared with the advice and 
assistance of many leading statisticians throughout the world, de- 
fines and explains statistical terms in current use, including those of 
a semi-mathematical character commonly used in the exposition of 
statistical theory. 

Equivalent terms with reference to the basic English text have been 
added in French, German, Italian and Spanish. 


493 pages $4.50 


STECHERT-HAFNER, INc. 


FOUNDED IN NEW YORK 1872 


b> 3 World A oLleading International Booksellers 


31 EAST 10th STREET, NEW YORK 3, N. Y. 
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The IDEA Factors 
Behind MARKET RESEARCH 


TABULATING 


Preliminary 
Planning 


STATISTICAL service begins long before 
a button is pushed. 


You get preliminary assistance in re- 
solving your ideas . . . in translating 
sound thinking into well-planned 
questionnaires for the most practical 
and economical processing. 


There is always a best way to handle 
any assignment and STATISTICAL can 





STATISTICAL 


TABULATING CORPORATION 


Established 1933 - Michael R. Notaro, President 


TABULATING - 


Practical 
Procedure 





Economical 
Processing 





help you apply it through long experi- 
ence in methods and procedures. 


The same careful approach is used in 
processing data to assure highest qual- 
ity in market information. Strict con- 
trols are maintained every step of the 
way from editing and coding to 
finished report. 


And this professional service is avail- 
able to you days, nights, week-ends— 
any time you need it. 


Just call our 
nearest office 


CALCULATING + TYPING 


TEMPORARY OFFICE PERSONNEL 


CHICAGO, 53 W. Jackson, HArrison 7-4500 + 





NEW YORK, 30 Broad St., WHitehall 3-8383 


ST. LOUIS, 411 N. Tenth St., MAin 1-7777 +> NEWARK, National Newark Bldg., MArket 3-7636 
CLEVELAND, 1375 Euclid Ave., SUperior 1-8101 » LOS ANGELES, 541 S. Spring St., MUtual 8201 
KANSAS CITY, 20 W. 9th St., GRand 1-3622 
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ween renew eens 


Mathematicians 


MS to PhD 
OPERATIONS ANALYSIS 


Vitro Laboratories in West Orange, New Jersey is 
presently expanding its Analysis Group. There are 
staff openings for Operations Research Analysts 
with masters or doctorates in Mathematics or 
Physics and 3 or more years experience. 


The work is primarily the analysis and evaluation 
of weapons systems. It is desirable that candidates 
have not only intensive mathematics training, par- 
ticularly in statistics and probability, but also be 
familiar with at least one of the following: radio 
communications, information theory or infra-red 
techniques. The analysis to be employed will re- 


quire a knowledge of the methods of linear pro- 
gramming, servomechanisms theory and model 
construction. 


These openings require men with vision and initia- 
tive. Our modern laboratory provides a profes- 
sional working atmosphere and the location in a 
quiet suburban area provides pleasant living and 
working with easy access to the cultural and edu- 
cational facilities of New York City. 


All inquiries in confidence. Please send resume in- 
cluding salary desired to A. A. Franklin. 


WTO 
LABORATORIES 


DIVISION OF VITRO CORPORATION OF AMERICA 
200 Pleasant Valley Way, West Orange, New Jersey 


(Other laboratories located at Eglin AF Base, Fla. & Silver Spring, Md.) 
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SPRING PUBLICATIONS: 


¢ QUALITY CONTROL AND INDUSTRIAL STA7ISTICS 

Revised Edition, by ACHESON J. DUNCAN, The Johns Hopkins 
University 

This basic text for courses in Statistical Control, Industrial Statistics, 
or Elementary Statistics for Engineers has been revised and brought 
up to date. Its outstanding feature is its explanation of the theory 
underlying statistical quality control. It not only describes procedures 
but also develops the theoretical basis upon which they rest. 


¢ WORKBOOK IN BUSINESS STATISTICS 

Fourth Edition, by LOUIS F. HAMPEL, United Air Lines, Inc. 
This popular and widely used workbook may be used with any stand- 
ard text in business statistics. In the new Fourth Edition up-to-date 
material has been used and new problems are introduced. Students 
are provided the opportunity to compile, analyze, and interpret sta- 
tistical data and to present the results of statistical analysis effec- 
tively. 


RICHARD D. IRWIN, INC. © HOMEWOOD, ILLINOIS 
Publishers of the widely adopted 
¢ BUSINESS AND ECONOMIC STATISTICS 


By WILLIAM SPURR, LESTER S. KELLOGG, and JOHN H. 
SMITH 











The complete introduction to 


Principles of STATISTICAL ANALYSIS 


SAMUEL B. RicHMonp, Columbia University 


An eminently teachable textbook minimum by introducing and explain- 





THE RONALD PRESS COMPANY « 15 E. 26th St., New York 10 
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which presents a thorough, well-illus- 


trated treatment of the collection, anal- 


ysis, and presentation of statistical 
data. Organized around the modern 
concept of statistical induction, book 
keeps mathematical procedures to a 


ing the techniques employed at the 
point of use. A unique feature is a Glos- 
sary of Equations in which each basic 
equation used in the text is listed, lo- 
cated, and briefly explained. 

210 ills., tables; 491 pp. $6.50 


For clearer visualization of facts and figures. . . 


Handbook of GRAPHIC PRESENTATION 


Catvin F. Scump, University of Washington 


A working guide for all concerned with 
producing meaningful, interest-rousing 
charts and graphs. Handbook shows 
how complicated data can be put into 
easily intelligible form; fully analyzes 
each basic type of chart indicating its 
advantages and disadvantages in pre- 


senting information of different kinds. 
Includes the first detailed discussion of 
3-dimensionals. “.. . a highly creditable 
job of producing a reliable and helpful 
handbook.”—Journal of the Ameri- 
can Statistical Association. 

210 ills., tables; 316 pp. $6.00 
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Just Published! 


MODERN BUSINESS STATISTICS 
by Freund and Williams 
Se pianos as 


Modern Business Statistics re ts a skillful 
ing of the traditional matter of de- 
statistics with the newer ideas of statis- 
inference. Without neglecting those tradi- 
tional methods which continue to be used in busi- 
ness. The authors: John E. Freund, of Arizona 
State College, and Frank J. Williams, Universi 
of Tennessee, introduce i , the basic no- 
tions of theoretical and sampling distributions and 
the logic underlying the making of business de- 
cisions in the face of uncertainty as to their out- 
come. 


Thus both the old and the new are brought to- 
gether in a well-balanced whole which presents as 
ons stadia receptor! seammodie tae + 
important of introductory statistics used 
in modern-day Good illustrations, ex- 
amples, and many classroom tested problems, help 
to put the important ideas of modern statistics 
within the reach of the average student. 


1958 608 pages Text price $7.50 
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